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TRANSPOSON-INSULATOR ELEMENT DELIVERY SYSTEMS 

Cross-Reference to Related Applications 
This Application claims priority to Unites States provisional patent Serial No. 
60/440,125, filed January 15, 2003, which is hereby incorporated by reference herein. 

Field of the Invention 

Some embodiments of the invention are in the field of the delivery of exogenous nucleic 
acids to cells, especially by use of transposons that incorporate insulator elements. 

Background 

Vectors used to deliver genes into chromosomes of humans often integrate in a near 
random fashion into chromatin. The problem of insertional mutagenesis occurring as a result of 
random integration of a genetic construct has been a long-term problem that has been recognized 
for many years (e.g., see Verma and Somia, 1997). Recently, the most serious concerns about 
this problem were fully realized when human patients died as a result of gene therapy treatments 
that triggered unwanted transcription of the patients' genes. These events apparently happened 
because transcriptional regulatory elements can act over long distances in chromatin, up to 
nearly 10,000 base pairs. 

These events involved the use of retroviruses to correct deficiency of the common 
gamma chain in X-linked severe combined immunodeficiency (X-SCID). In these clinical trials, 
the retrovirus treatment resulted in a substantial therapeutic benefit to patients 21 ' 55 but 
unfortunately two of these patients have developed a leukemia-like syndrome. These adverse 
events are associated with integration of the retrovirus and activation of a nearby proto- 
oncogene, LM02. As a result, in a precautionary measure, the Food and Drug Administration 
(FDA) placed on "clinical hold" all active gene therapy trials using retroviral vectors to insert 
genes into blood stem cells. 
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Summary of Preferred Embodiments 
The long-felt need to solve the problem of unwanted activation of host genes urgently 
requires a solution if gene therapy is to realize its clinical promise. Although the introduction of 
an exogenous gene into a host holds great promise for scientific study and medical therapies, 
5 improved methods for delivering nucleic acids are desired. In particular, a method that provides 
greater control of the effects on the native genome are desirable; in particular, approaches that do 
not activate, or stimulate expression of native genes. An approach to solve to this long-felt but 
unsolved need is set forth herein, and includes use of new gene vectors and use of factors in the 
vectors that stop unwanted transcription. 
10 As described herein, insulating elements may be used to inhibit the unwanted 

transcription of host genes. Because neighboring genes in the host are individually regulated, 
there are borders that insulate the genes from one another. The borders are nucleic acid 
sequences referred to as insulating elements. This Application sets forth materials and methods 
for using the insulating elements in conjunction with transformation with transposons. Insulator 
15 elements appear to protect certain stretches of DNA from being transcriptionally silenced; for 
example, exogenous DNA has been introduced into a cell in conjunction with an insulator 
element to thereby prevent the transcriptional silencing of the exogenous DNA. It has not 
heretofore been appreciated, however, that exogenous insulating elements can be used to 
transcriptionally silence a gene of a host. 
20 An embodiment for using an insulator element is a transposon having at least one 

transcriptional unit and at least one insulator element. The transcriptional unit(s) may be flanked 
by at least one insulator element on each side. The transcriptional unit may include an 
exogenous nucleic acid for introduction into a cell, e.g., DNA encoding a marker molecule. 
The insulator element may include a binding site for a CTCF protein. The insulator element may 
25 also include, e.g., at least one of SEQ ID NO:16, SEQ ID NO:17, OR SEQ ID NO:18, or 
structural and/or functional equivalents thereof. A transcriptional unit may be disposed between 
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a first insulator element and a second insulator element, and the first insulator element and the 
second insulator element may be disposed between inverted repeats of a transposon. The 
exogenous nucleic acid may be, e.g., DNA encoding an antisense RNA or siRNA. 

Other embodiments are directed to methods. For example, an embodiment is a method of 
5 altering a cell, the method involving exposing a cell to a transposon that has a transcriptional unit 
and at least one insulator element. The transcriptional unit may be flanked by at least one 
insulator element on each side of the transcriptional unit. A transcriptional unit may include an 
exogenous nucleic acid for introduction into a cell. 

A transposon may be introduced into a cell as appropriate for the application, e.g., by 

10 electroporation or microinjection. Transposons with insulators may be introduced into any type 
of cell. Examples of such cells include lymphocytes, pancreatic cells, neural cells, muscle cells, 
blood cells, hepatocytes, hepatoma cells, primary hepatocytes, liver cells, stem cells, primary 
pancreatic cells, pancreatic stem cells, primary hematopoietic cells, and hematopoietic stem 
cells. Cells in vitro are contemplated. Cell sin living animals are also contemplated. 

15 Embodiments include introducing vectors into animals, including embryos, humans, humans, 
zebrafish, mice, and rats. 

Another embodiment is a transposon having a transcriptional unit and a means for 
preventing regulation of transcription of host nucleic acid by the transcriptional unit following 
insertion of into a host mammalian cell nuclear genome. Such means are set forth herein, as will 

20 be understood by persons of ordinary skill in these arts after reading this application. 

Brief Description of the Drawings 
Figure 1 A depicts a host chromosome with two silent genes; 

Figure IB depicts the mechanism of activation of host chromosome genes after insertion 
25 of a transcriptional unit that includes an exogenous gene with transcriptional enhancer 
sequences; 
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Figure 2 depicts the blocking of insertional mutagenesis by the use of at least one 
insulator element; 

Figure 3 depicts an embodiment for inhibiting insertional mutagenesis following random 
insertion of transgenes by means of a suicide sequence nucleic acid, also referred to herein as a 
fail-safe suicide vector; 

Figure 4 depicts an embodiment wherein a nucleic acid sequence encoding a transposase 
300 is introduced into a chromosome 100. The transposase is introduced with a suicide sequence 
nucleic acid 240, that can be triggered to destroy cells that incorporate the transposase and 
suicide sequence. 

Figure 5 depicts CTCF-enhancer-blocking motifs with respect to a transcriptional unit on 
a chromosome; 

Figure 6 represents the features of a test plasmid for use in testing of insulator elements 
using a transposon. 

Description of Preferred Embodiments of the Invention 
Chromatin Structure, Insulator/Border Elements and Gene Expression in Vertebrates 

The expression of subsets of genes in the various tissues of a vertebrate animal 
determines the function of that tissue. Although all intact genes in vertebrate genomes can be 
expressed, in any given cell of any given tissue, most genes are not expressed 39,62 . Maintaining 
some genes in an active state while other genes are kept silent is essential for the proper 
functioning of the organism as whole. This is not a trivial problem in vertebrate cells because 
the regulators of gene expression often are spread out over tens of thousands of base pairs in the 
DNA 31 . Consequently, there is much to discover about how regulators of gene expression are 
constrained to act only on specific genes, which are expressed as transcriptional units. It is clear 
that the structure of the chromatin plays a critical role in maintaining the active or silent states of 
genes in eukaryotic organisms ranging from yeast to mammals. The structure of the chromatin 
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depends on the presence, and interactions among, chromatin-binding proteins that regulate gene 
expression 116 ' 140 . The importance of chromatin structure for gene expression is also evident from 
genetic and transgenic studies 132 . 

Transgenic animals were one of the initial products of recombinant DNA-based 
5 biotechnology 56 ' 73 . The main idea was to introduce genes into animal genomes in order to direct 
the production of specific proteins. In order to achieve this end, it was important that the 
introduced genes would be expressed at reliable levels through multiple generations of the 
transgenic animal. The principle concern was that expression the transgenes not be shut down. 
Thus, transgenic animals have been very important tools for studying regulation of expression of 
10 newly introduced genes. 

C&-acting regulatory elements, the sequences of DNA to which chromatin-binding 
proteins attach to affect transcription of associated genes, have traditionally been studied using 
transient assays in which cultured cells are transfected or embryos are microinjected with DNA 
that remains unintegrated into chromosomes. However, when constructs containing all of the 
15 regulatory elements identified in transient expression studies are integrated into chromosomes of 
animals, the expression of these transgenes often is different from what is expected 151 . 

Many times the transgenes are either silent or expressed ectopically, that is, in tissues 
other than where expression was expected. Additionally, in animals that have multiple copies of 
a transgene in their genomes, overall expression levels of the transgenes are much lower than 
20 expected and not proportional to the their number in the chromosomes, which suggests that the 
genes are frequently turned off 151 . The dependence of expression of a transgene on the particular 
site of integration is presumably due to the influences of the neighboring chromatin 5,14 . This 
site-dependent variability of expression of a transgene is called position effect In contrast, 
transfers of genomic clones that contained genes flanked by extensive upstream and downstream 
25 regions many times resulted in position-independent expression of transgenes 3,54 ' 108 ' 138 . The 
position-independent expression of transgenes flanked by DNA sequences that extended well 
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beyond the identified transcriptional regulators indicated that in addition to the classical 
transcriptional regulators, known as enhancers and silencers, there are other types of DNA 
sequences that influence expression of genes integrated into chromosomes in an "all-or-none" 
manner. Transgenes are subject to equivalent position effects in all vertebrates, from zebrafish, a 
5 model system used by the inventor(s) to identify functions of genes and their regulatory units, to 
humans 1 ' 2 ' 30 ' 52 ' 87 . 

Organization of Genetic Material in the Cell Nucleus 

While not being limited to a particular theory of operation, some general principles 
provide a useful conceptual framework. Genomic DNA in diploid mammalian cells is about two 

10 meters long and is packaged into a roughly spherical nucleus with an average diameter of 10 
|Lim 32 . Therefore, chromatin has to be carefully compacted to fit into nuclei. The structure of 
chromatin is dynamic to support regulated expression of the tens of thousands of genes present in 
the DNA strands. Hence, interphase chromatin, the state of DNA with auxiliary proteins that 
exists when a cell is not replicating and dividing, should be organized so that transcriptional 

15 factors bound to enhancers and silencers interact only with the RNA polymerase complexes 
assembled at the promoters of the genes they are supposed to regulate. Cfo-acting regulatory 
elements can be located up to tens of thousands of base pairs upstream and downstream from the 
coding region of the gene they regulate 95 . When placed between two promoters enhancers can 
simultaneously activate transcription from both promoters 18 ' 128 . Therefore, mechanisms that 

20 prevent inappropriate promoter/enhancer interactions are necessary for the regulated expression 
of genes in eukaryotic cells. 

Furthermore, chromosomes appear to be divided into domains of gene expression that are 
separated by insulator elements. One or more genes, with their transcriptional regulatory 
elements, can reside in a single domain. Insulator elements are DNA sequences that are believed 

25 to function by binding specific proteins to prevent regulation of gene expression in one domain 
by the transcriptional regulators of neighboring domains. Chromatin domains are probably 
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either structural units helping to package DNA inside the nucleus, functional units that allow 
gene expression independent of the chromatin structure in the neighboring chromatin, or 
both 50 ' 127 . Thus, it is inferred that border elements are involved in regulated gene expression, as 
indicated by the results of genetic, biochemical and cytological examinations of chromatin 
5 structure and gene expression. Certain embodiments in this Application take advantage of these 
border element (also referred to herein as insulator elements) properties to make improved 
vectors for gene therapy and as research tools. 

Genetic Evidence for Chromatin Domains » 
In addition to the evidence derived from cell organization data, there is genetic evidence 

10 that supports the importance of the chromosomal environment for gene expression. Studies in 
yeast, mammalian cells insect cells, and cells of other animals and plants show the existence of 
chromatin domains. For example, when the white gene, which normally resides in the 
euchromatin (portions of a chromosome in which genes can be expressed compared to 
heterochromatin that is comprised of DNA sequences essential for chromosome replication and 

1 5 segregation during cell division but which cannot support transcription of genes) translocates to a 
position near centromeric heterochromatin, its expression pattern is altered resulting in eyes with 
red or yellow patches 122 . This is called position effect variegation (PEV). PEV results from the 
spreading of heterochromatin into the translocated euchromatic gene in a portion of cells early in 
development 39 . The heterochromatic or euchromatic state of the translocated gene is inherited in 

20 progenitor cells, leading to variegated eye-color phenotype. A similar phenomenon has been 
observed in the yeast, S. cerevisiae. Genes that were experimentally introduced near yeast 
telomeres are subject to repression by the telomeric heterochromatin 116 . Expression of ADE2 
gene at its normal chromosomal position gives rise to white yeast colonies whereas mactivation 
of ADE2 gene results in red colonies. ADE2 gene, placed near telomeres, is expressed in some 

25 yeast cells while telomeric heterochromatin silences ADE2 gene in other cells 53 . The silent or 
expressed state of ADE2 is inherited in progenitor cells giving rise to sectored, red and white, 
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colonies. Repression of genes by telomeric heterochromatin is called "telomeric silencing". 

121 

Genes located up to four kilobases away from telomeres can be subject to telomeric silencing . 
Sectored colonies in yeast and variegated eye-color in Drosophila suggest that the silent or 
expressed state of genes near heterochromatin, once established, can be maintained through 
5 many rounds of cell divisions. PEV and telomeric silencing support the hypothesis that the 
chromosomal environment plays an important role for maintenance of the expressed or the silent 
state of genes and that there are boundaries which prevent spreading of the heterochromatin into 
the euchromatic regions of genomes. 

Genetic evidence for the existence of DNA sequences which prevent promoter/enhancer 

10 interactions comes from mutant phenotypes caused by insertions of gypsy retrotransposons 
between c/s-acting regulatory elements and promoters of genes such as yellow, Ultrabithorax 
and cut 61 ' 98,111 . The mutant phenotypes can be reverted either by excision of the retrotransposon 
or by mutations in the su(Hw) gene. The portion of the gypsy transposon responsible for mutant 
phenotypes is an array of binding sites for Su(Hw) protein 112,131 . The Su(Hw) protein also binds 

15 to sites in approximately 200 bands of polytene chromosomes that do not contain gypsy 
transposons 60,110 . All of this indicates that there are DNA elements that bind specific proteins to 
insulate genes from activation or repression by neighboring transcriptional regulatory elements. 
Cytological and Biochemical Evidence for Chromatin Domain - MARs 

A significant body of research has been devoted to candidate matrix attachment regions 

20 (MARs). This research, when viewed in light of recent data about insulator elements, provides 
further insight into insulator element structure, function, and activity. A brief overview of this 
body of research, starting at the level of chromatin organization, is helpful for placing insights 
into insulator elements in context. 

Chromatin is structured with several levels of organization. The first level of 

25 organization is the winding of DNA around core histones to yield nucleosomes 79 . Nucleosomes 
coil into a solenoid structure to form what is called the 30 nm fiber that consists of six 
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nueleosomes per turn of the coil. 30 mn fibers appear to have a ribbon-like, zigzag organization 
with linker DNA crisscrossing the axis of chromatin fiber 153 ' 154 . 30 nm fibers are coiled into 
higher order coils to give greater compaction of the DNA. The highest levels of organization can 
be seen in certain types of chromosomal structures called lampbrush chromosomes that were 
5 observed in early cytological studies of amphibian oocytes and polytene chromosomes in insects. 
Loops of chromatin of uniform thickness appear to be anchored to a protein complex that runs 
along the axis of chromosome 127 . Transcribed genes, marked by nascent mRNA strands in 
electron micrographs, reside in the loops. In polytene chromosomes from fruit flies, which have 
reproducible pattern of bands that are thought to result from different degrees of chromatin 

10 compaction, transcriptional activation is accompanied by expansion of these bands into "puffs" 
that may reflect myriads of RNA tendrils emerging from the DNA templates. Such cytological 
studies suggested that chromosomes are subdivided into a series of domains, manifested as loops 
anchored to a network of proteins called either the nuclear matrix, the nuclear scaffold or the 
nucleoskeleton 114 . The nucleoskeleton extends throughout the nucleus 64 . In one model derived 

15 from identifying sites of RNA polymerase II binding to chromatin, the activation of genes may 
determine chromatin organization 129 . In an alternative model, special DNA sequences demarcate 
boundaries between loops. These sequences are presumably bound by nuclear matrix proteins 
that attach the loops to the matrix. 

Several biochemical methods have been developed to isolate nuclear matrix proteins and 

20 identify the DNA sequences that bind nuclear matrix. Nuclei extracted with lithium 
diiodosalicilate to remove histones and the majority of other nuclear proteins produce DNA and 

96 

what is called the nuclear scaffold . Chromosomal DNA sequences to which the proteins from 
nuclear scaffolds can bind are called scaffold attachment regions (SARs). Alternatively, another 
method of chromatin extraction using 2M NaCl yields what is called the nuclear matrix 24,23 . 
25 Proteins associated with the nuclear matrix can be used to test other DNA sequences for binding 
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in vitro to identify candidate matrix attachment regions (MARs). One might expect that SAR- 
binding proteins and MAR-binding proteins should be the same but few tests for identity of the 
two types of protein have been done, in part because both treatments are sufficiently harsh that 
there are some doubts about the integrity of the isolated matrix and scaffold proteins - both 
5 methods yield poorly defined mixtures of proteins that vary with modifications of the isolation 
protocols 7 ' 38 . Proteins identified as components of nuclear matrix or scaffold include 
topoisomerase II 10 , nuclear matrins 6 ' 58 ' 104 SATB1 from thymus cells 29 ' 34 ' 103 , lamins A and 
B9 0 ' 91 , SAF-A from HeLa nuclei 125 , attachment region-binding protein (ARBP) 144 ' 145 and 
pl20/hnRNP 146 . ARBP binds to specific sequence motif in 5 5 -MAR from the chicken 
10 lysozyme gene 144 ' 145 . The SAF-A and SATB1 proteins can bind cooperatively to many different 
MARs 34 ' 103,125 . MARs have been isolated from many organisms, ranging from yeast to 

humans 17 ' 26 . Since MARs reside between genes as well as near transcriptional regulatory 
elements, it has been hypothesized that different classes of MARs have different roles in gene 
expression 82 : MARs located between genes are thought to act as border elements while MARs 
15 which reside near enhancers and silencers are proposed to participate in regulation of 

transcription^'^ 9 . 

The sequences of MARs proximal to different genes are not identical. The sequences of 
some MARs are AT-rich, implying specific conformational properties of the double helix such 
as curvature and different sizes of the major and minor grooves. Other MARs appear to share 
20 sequence motifs, such as ATTA and ATTTA, that are so short as to be highly distributed (e.g. 
every few hundred basepairs of DNA on average) even though they contain recognition 
sequences for topoisomerase II, one of the proteins associated with MARS. Thus, some matrix 
protein preparations may be contaminated with polypeptides that recognize general 

conformational features of DNA rather than any specific sequence motifs 17 . This may explain 
25 the lack of sequence similarity among MARs, the variations in protein composition of various 
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MAR and SAR preparations, and thus clouds conclusions of various reported interactions 
between MARs, which we henceforth use to include SARs, and matrix proteins in vivo. 

Gentler methods for isolating chromosomal structures have been achieved by 

encapsulating cells into agarose beads followed by lysis with physiological buffers > . 

5 Cytoplasmic components and chromatin can be removed by electroelution to leave a structure 
called the nucleoskeleton. Major components of the nucleoskeleton are lamins 64 . Functional 
RNA polymerases may be attached to the nucleoskeleton 69 ' 71 ' 117 . In these preparations, 
multiple RNA polymerases appear to be clustered in foci where transcription takes place 
suggesting that these nuclear structures reflect their natural states. Observations such as these 

10 lead to a model in which the function of the nucleoskeleton is similar to the function of the 
cytoskeleton in the cytoplasm 28 ' 70 . In addition, transcriptional factories are attached to the 
nucleoskeleton. Templates are somehow brought to the transcriptional factories and threaded 
along them during transcription. According to this model the chromatin loops are formed by 
virtue of template attachment to transcriptional factories. Therefore, chromatin loops are 

15 dynamic, rather than static structures 129 . This model does not address the problem of 

inappropriate promoter/enhancer interactions. 

Biochemical evidence supports the cytological evidence for division of chromatin into 

transcriptionally active and inactive domains. DNase and restriction endonuclease sensitivity 

studies used to probe modifications of nucleosomes in genes that are actively transcribed provide 
20 additional biochemical evidence for the existence of chromatin domains. Genes that are actively 

transcribed or poised for transcription reside in chromatin that is more susceptible to digestion 

with nucleases than chromatin containing genes that are not expressed 148 . Regions of DNase- 
sensitivity encompass coding regions of genes and also extend into upstream and downstream 
flanking regions and may coincide with the location of MARs 11,8 ^' 88 . The explanation for this 
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is obvious, chromatin fibers that are being transcribed by RNA polymerases are expected to be 
less encumbered with chromosomal packaging proteins and therefore less condensed, allowing 
easy access of DNA binding proteins to transcriptional regulatory elements as well as nucleases. 
Nucleosomes in actively transcribed genes are usually hyperacetylated and have other 
5 modifications to their histones relative to genes that are not transcribed 33 ' 106 ' 123 ' 134 . Loss of 
transcriptional activity of a transgene is accompanied by DNA methylation and histone 
deacetylation and is prevented by insulators 115 ' 120 . These two biochemical lines of evidence 
support the cytological studies that show that the eukaryotic genomes are divided into functional 
domains. For the |u immunoglobulin gene, nuclear matrix attachment regions antagonize 

10 methylation-dependent repression of long-range enhancer-promoter interactions 43 ' 44 . The role of 
DNA sequences in organizing chromatin can be tested as well as applied in transgenic animals, 
as described next. 
MAR-Type Insulator Elements 

All of this information provides evidence that insulator elements play a significant role in 

15 MARs. Transgenic animals have been used to test the ability of MARs to function as insulator 
elements. MARs appear to be able to ensure proper expression of transgenes in terms of tissue- 
specificity and timing. The lysozyme gene is expressed in macrophages and in hen 

oviducts 13 ' 14 . In these two tissues, the chicken lysozyme gene resides in a region of open 
chromatin that extends 7 kb upstream and 4 kb downstream from the coding region of the 

20 gene 88 . At the edges of the open chromatin region are MARs that are postulated to separate the 
chicken lysozyme domain from neighboring domains. The MAR located upstream of the 
lysozyme gene is called the A-element Between the A-element and the lysozyme gene are 
enhancers and silencers that regulate lysozyme expression in the oviduct and in myeloid 
tissues 13 . The A-elements allow expected patterns of expression of transgenes in tissue culture 

25 and in mice, although position effects are sometimes observed 13 ' 14 ' 15 ' 133 . There is a 
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significant body of research that describes the functional, structural, and detailed features of A- 
elements so that a person of ordinary skill in these arts is able to identify A-elements. In 
combination with the disclosures of this Application related to insulator elements, therefore, a 
person of ordinary skill in these arts can make, use, and identify A-type insulator elements. 
5 Although the MARs from the chicken lysozyme gene enhance the expression of 

transgenes integrated into chromosomes, they have no effect in transient expression assays in 
which the genes are expressed from plasmids rather than chromosomes 1 1 ° > 1 0 . When A- 
element and 3 '-MAR sequences were removed from the transgenic constructs, the expression 
levels of the lysozyme gene remained proportional to the number of transgenes integrated into 

10 chromosomes but the transgenes were expressed ectopically 16 . Conversely, when parts of 5- 
flanking region were removed from the transgenic constructs, A-element and 3 '-MAR sequences 
prevented ectopic expression of the lysozyme transgene but the levels of expression were not 

copy number-dependent 1 u . 

The A/T-rich binding protein ARBP is an abundant nuclear protein with high affinity for 
1 5 MAR/SARs. ARBP is homologous to rat methyl-CpG-binding protein MeCP2 and may function 

149 

in defining chromatin loops and domains bordered by MAR sequences 

Similarly, A-elements were placed upstream and downstream of the whey acidic protein 
(WAP) gene and introduced in mice. The temporal and spatial specificity of expression of the 
WAP transgene, flanked by A-elements, corresponded to that of the endogenous WAP gene, 

20 whereas transgene expression was disregulated in the absence of A-elements 92?93 . However, as 
with the lysozyme gene, the expression levels of WAP transgenes flanked by A-elements were 
not proportional to the number of transgenes integrated into chromosomes. MARs flanking the 
mammalian alipoprotein B gene have been used to insulate mammalian reporter genes in 
mammalian cells 74 and when flanking the white gene of Drosophila were able to confer position- 

25 independent expression in flies, demonstrating that mammalian MARs were functional in insect 
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cells 105 . These results indicate that MARs can protect transgenes from the influences of 
neighboring enhancers and silencers but that additional regulatory elements need to be present 
for copy number-dependent transgene expression. 

MAR-like sequences have been identified at the boundaries of the DNase-sensitive 
5 region of the human p-interferon (p -IFN) gene. When a HSP70.1 promoter-luciferase gene 
construct was flanked by 0 -IFN gene MAR sequences, the expression profiles were as expected 

but the levels were not copy number-dependent 139 . These results support the idea that MARs 
can prevent interactions of promoters from transgenes with the enhancers and silencers from 
neighboring genes. Thus, either with their normal genes or with heterologous genes, MARs act 
10 as neutral boundaries that do not alter the expression pattern of genes they flank, but they do not 
seem to protect genes against repression when integrated into some regions of 

chromosomes 92 ' 97 ' 139 . Thus, it is known that some MARs, and A-elements, contain insulator 
elements that satisfy the test criteria for an insulator element, as set forth herein. As such, a 
person of ordinary skill in these arts will immediately be able to recognize such insulator 

15 elements and, after reading this Application, will be able use them with embodiments of the 
inventions as set forth herein. 
Border-Type Insulator Elements 

Some researchers have reported on border-type insulator elements that appear to function 
by a different mechanism than do MARs. Some MARs, however, apparently contain insulator 

20 elements. Therefore the different mechanisms attributed to MARs as compared to insulator 
elements may be a result of DNA sequences in the MARs associated with structural functions 
rather than strictly biochemical functions as well as functions that govern DNA replication as 
well as transcriptional regulation, which is the responsibility of insulator motifs. 

The scs/scs 5 -elements from the 87A7 heat shock locus, su(Hw) binding region from the 

25 gypsy retrotransposon and the DHS5 site from the chicken p-globin locus are DNA sequences 
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capable of alleviating position effects in transgenic fruit flies 23 ' 76,126,141 . The 87A7 locus spans 
14 kb and contains two hsp70 genes, transcribed in opposite directions, that are flanked by 
special chromatin sequences (scs/scs 9 ) elements. Scs and scs 9 elements each have a nuclease- 
resistant core flanked by DNase-hypersensitivity (DHS) sites that correlate with the boundaries 

5 of transcriptional regions and therefore with the sites that are likely to harbor border element 
sequences 12,35,65 ' 83 ' 84,148 . The position of DHS sites within the scs/scs '-elements changes upon 
heat shock induction of hsp70 genes, suggesting that chromatin domains may be structure-based 
rather than sequence-based 142 ' 143 . Both DHS sequences harbor topoisomerase II-recognition 
sequences, are under torsional strain in vivo 144 , and both bind proteins in a sequence-specific 

10 manner. The scs element binds the protein bangdoo 50 . Scs' element binds the protein complex 
called BEAF (border element-associated factor) that consists of alternatively spliced forms of 

BEAF32 protein 61 ' 157 . Antibodies raised against BEAF32 bind to about 100-150 bands on 
polytene chromosomes 157 , suggesting the presence of insulators throughout the Drosophila 
genome. Another scs-binding protein, zeste-white-5 (Zw5), is able to block enhancer-promoter 

15 interaction by binding to a specific 24-bp sequence within the scs element 46 . Thus, the scs/scs 9 
insulators appear to act in concert with specific DNA-binding proteins. 

In transgenic fruit flies, scs/scs '-elements can insulate the white gene, driven by a 
minimal promoter, from activation by neighboring enhancers. This resulted in flies with pale 
yellow eyes 76 . All transgenic flies had bright red eyes when scs/scs '-elements flanked a white 

20 gene equipped with all its regulatory elements, indicating that scs/scs '-elements protected the 
transgene from repression by silencers. The ability of scs/scs -elements to prevent interactions 
between transcriptional regulatory elements and promoters was tested further in enhancer- 
blocking assays. Both elements prevented activation of transcription of transgenes when placed 
between a wide variety of enhancers and promoters 77 and in a multiplicity of chromosomal 

25 contexts, even in amphibian and zebrafish cells 19 ' 20 ' 37,80 . 
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Protein-binding ability led to the identification of another insulating sequence, that in the 
gypsy transposable element of Drosophila. A DNA sequence in gypsy retrotransposons, which 
contains twelve binding sites for the Su(Hw) protein, was identified as an insulator because it 
can prevent the regulation of transcription of genes when the gypsy transposon lands between 
5 their promoter and czs-acting sequences 67 ' 98,111 ' 126 . Other polypeptides like the mdg4 proteins 
may interact with su(Hw) to establish insulation 45 ' 49 ' 51 . The su(Hw)-binding region protects 
transgenes from position effects and has the ability to block interactions between many different 
enhancers and promoters 18 ' 76 ' 78 ' 126 . Moreover, the gypsy insulator determines the nuclear 
localization of DNA; DNA sequences flanked by Su(Hw)~elements tended to be on the nuclear 
10 periphery introduced into Drosophila diploid imaginal disk cells 48 . Thus, this activity correlates 
with cytological observations noted earlier. Surprisingly, insulation by Su(Hw)-binding sites is 
sensitive to the number of insulators. Two tandem copies of insulator Su(Hw) were ineffective 
in blocking various enhancers from activating a downstream promoter, a single enhancer was 
partially effective in blocking enhancer activation when placed between the promoter and 
15 enhancer, and flanking an enhancer with single insulators on each side was essentially 
completely effective in blocking its activity 18 ' 18 *' 99 ' 101 . These data do not suggest that these 
insulating elements work by simple structural rearrangement of chromatin 47 nor by some sort of 
simple decoy mechanism 50 . Rather, the data suggest that the border elements work in pairs and 
can cancel their boundary effects when placed close to one another, perhaps by a mechanism 
20 similar to that which occurs with transfection 100 . 

The DNA fragment which contains 5 ? constitutive DHS from the chicken p-globin locus 
control region (LCR) also insulates the white gene from position effects in transgenic fruit 
flies 82 . The Felsenfeld laboratory 8 ' 9 ' 22 ' 23 ' 41 ' 102 ' 119 ' 150 and others 85 ' 86 ' 147 ' 155 have conducted detailed 
molecular studies of this locus. They have found that the gamma and beta-globin loci contain 
25 both properties associated with border elements and insulator elements. The first are barrier 
motifs that serve to block heterochromatization by spread of methylation that can permanently 
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shutdown expression of transgenes. The second property is enhancer-blocking activity mediated 
by a CTCF-binding site such that enhancers neighboring a transgenic construct will not influence 
the activity of the transgene 8 ' 9 ' 81,150 . CTCF-binding sites have been identified in other boundary 
elements 42 ' 75 . 

5 Other examples of insulator elements include the sns insulator from the sea urchin 

arylsulfatase gene 4 ' 36 , sequences flanking the early H2A histone gene in sea urchins 94,107 , the 
Fab-7 sequence in the Drosophila bithorax complex that insulates the iah-7 gene 5,57 ' 158 , the MAR 
sequences flanking the tyrosinase gene in mammals 118 , and the methylation-regulated 63 insulator 
sequences found in the H19/Igf2 locus 59,75 . 
10 In partial summary of information about insulator elements, as stated above, 

chromosomes appear to be divided into domains that contain either expressed or silent 
genes 127,152 . These domains are presumably separated by DNA sequences that have a variety of 
names including insulator sequences and border elements 121 . As used herein, however, the term 
"insulator element" encompasses nucleic acid sequences that have been referred to as border 
15 elements as well as insulator elements. Insulator elements appear to be able to prevent influences 
of neighboring chromatin on genes within a chromatin domain 127 . Insulator elements can be 
identified as DNA sequences capable of alleviating position effects in transgenic animals. In this 
application, an insulator element refers to a DNA sequence that will block transcriptional 
regulatory sequences in a transgenic construct from acting on neighboring genes after the 
20 integration of the transgenic construct into a chromosome. Detailed test protocols for an 
insulator element are provided below. Certain embodiments are insulator elements have a size in 
the range of about 10 to about 2500 base pairs; a person of ordinary skill will realize that all 
ranges and values within the range specifically set forth are contemplated and included herein, 
e.g., about 20 to about 1250 base pairs, less than about 1000 base pairs, and about 20 to about 
25 300 base pairs. Further, some insulator elements are known to comprise certain regions for 
binding factors to regulate transcription; accordingly, some embodiments are directed to those 
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motifs, or essentially those motifs. 

As set forth above, insulator elements have some common features. First, they are 
typically active only when part of chromatin. Second, insulator elements do not alter the 
expression levels of genes in transient assays. Third, neither MARs nor other insulator element 
5 types alter tissue-specificity of transgene expression. Essentially MARs appear to block 
enhancer activity, but do not have silencing-blocking activity. Many insulators have both. 
Because insulator elements have a role in establishing domains of open chromatin characterized 
by global changes in histone modifications, they have been employed in transgenic studies to 
protect transgenes on viral vectors from being silenced and/or from their expression being 

1 0 influenced by neighboring transcriptional regulatory sequences. 
Insulator Elements In The Patent Literature 

Some reports of insulators are set forth in the patent literature. For example, US 
6,395,549, entitled "Long terminal repeat, enhancer, and insulator sequences for use in 
recombinant vectors", hereby incorporated by reference, describes use of an insulator element 

15 with a viral construct to control expression of exogenous genes introduced into a cell. And, for 
example, US 5,731,178, entitled " Attachment-elements for stimulation of eukaryotic expression 
system" describes use of an insulator element, i.e., an attachment element, with a viral 
transfection construct. Similarly, other reports include US 6,100,448, entitled "Increasing 
expression of transgenes in plant cells using insulator elements" and US 5,610,053, entitled 

20 "DNA sequence which acts as a chromatin insulator element to protect expressed genes from 
cis-acting regulatory sequences in mammalian cells." 

No one heretofore, however, is believed to have proposed using insulator elements to 
protect resident chromosomal genes from activation by inserted exogenous and/or transgenic 
constructs. Indeed, these applications, above, are generally limited to the use of insulator 

25 elements to prevent repression of an exogenous gene in a host cell. In contrast, some 
embodiments set forth herein are directed to insulator elements for preventing transcription of 
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native genes when an exogenous gene is introduced into a host cell. The unwanted transcription 
of native genes has been a long-felt need that has not been addressed. The use of insulator 
sequences as described herein, however, addresses this problem. One reason that insulator 
sequences have not been previously proposed as a solution to this problem is that insulator 
5 sequences are commonly understood as promoting the transcription of the introduced exogenous 
sequence. In contrast, as set forth herein, and without being limited to a particular theory of 
operation, it is understood that insulator sequences serve to create domains to which transcription 
factors are isolated. As a result, enhancers and/or promoters inserted into a host chromosome do 
not activate the transcription of nucleic acids outside of the domain. Thus, insulator elements 

10 can be used to isolate exogenous transcription sequences from native genes, and prevent the 
transcription of host nucleic acids. 

Moreover, these patent applications, set forth above, are generally limited to the use of 
viral vectors. Certain embodiments herein, however, are directed to the use of transposon 
vectors. There are differences between transposonal vectors and viral vectors that would not 

15 lead a person of ordinary skill in these arts to substitute one for the other with respect to the use 
of an insulator element. For many applications involving genetic transfection, e.g., gene therapy, 
long-term expression of genes is essential. Transcriptional silencing of genes introduced by 
retroviruses poses a major obstacle to their use as gene therapy vectors, and border elements 
have been proposed as a solution 109 . Consequently, a number of investigators have incorporated 

20 border elements into their constructs to block methylation of the therapeutic gene 40,66 ' 124 ' 135 . 
Hackett has gone further to demonstrate, using zebra fish and viral transfections as a model 
system, that border elements can maintain reproducible levels of transgene expression in 
multiple generations. 19,20 . However, none of these studies have addressed the blocking of 
enhancer effects of the transgenic constructs on endogenous chromosomal genes. Nor do these 

25 studies address introducing insulator elements with transposons. Transposons, in fact, are not 
subject to the same transcriptional silencing challenges that plague the viral vectors. 
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Insulator Elements for Use with Vectors 

Figure 1 depicts the phenomenon of insertional mutagenesis following random insertion 
of transgenes. The depiction Figure 1 is simplified by neglecting the effects of three- 
dimensional structures and corresponding proteins and is a guide to the relevant issues at the 

5 level of DNA. Chromosome 100 having genes 110, 120 and the genes having promoters 112, 
122 and enhancers 114, 124 is depicted in Figure 1A, wherein genes 110, 120 are 
transcriptionally silent. Insertion of an exogenous therapeutic or marker sequence 130, along 
with promoter 132 and enhancer 134 may activate the nearby genes 110 or 120 via the enhancer 
134, with the solid line 140 indicating a strong activation and the dotted line 150 indicating a 

10 weaker or nonexistent activation. Arrows 160, 170, 180 show expression of the genes, with the 
weight and length of the arrows indicating potential expression levels; with arrow 170 indicating 
the most expression, arrow 160 indicating lesser expression and arrow 180 indicating the least 
potential expression. 

Figure 2 depicts the blocking of insertional mutagenesis by the use of at least one 
15 insulator element. The inserted nucleic acid sequence 130 is flanked by insulator elements 180, 
190, which are flanked by transposon inverted terminal repeat sequences 200, 210. The insulator 
elements block paths 140, 150, as indicated by Xs 220, 230. The transcription of exogenous 
nucleic acid sequence 130 is allowed but the expression of genes 110 and 120 is prevented. 
Alternatively, only one insulator element could be used, with that element preferably being 
20 placed as shown for element 1 80. The inverted terminal repeats of the transposon show how the 
vector can be designed for use with a transposon-based vector system. 

Figure 3 depicts an embodiment for inhibiting insertional mutagenesis following random 
insertion of transgenes by means of a suicide sequence nucleic acid, also referred to herein as a 
fail-safe suicide vector. As set forth in Figures 1 and 2, an exogenous gene 130 may be 
25 introduced into chromosome 100 using a transposon. A suicide sequence nucleic acid, e.g., a 
suicide gene, may also be introduced with gene 130. The suicide sequence 240 may be placed 
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near gene 130, or further away, as shown in Figure 3B. A promoter 242 may be incorporated 
into the chromosome 10 in a position to promote the suicide sequence 240, in which case 
expression of the suicide sequence would be increased, as indicated by arrow 250. The suicide 
sequence enables a triggering event to be used to kill cells transected with the sequence. For 
5 example, a substrate molecule may be supplied as a triggering event. 

Figure 4 depicts an embodiment wherein a nucleic acid sequence encoding a transposase 
300 is introduced into a chromosome 100. The transposase is introduced with a suicide sequence 
nucleic acid 240, which can be triggered to destroy cells that incorporate the transposase and 
suicide sequence. 
10 Vectors 

Nucleic acids can be incorporated into vectors. As used herein, a vector is a replicon, 
e.g., a plasmid, phage, or cosmid, into which another nucleic acid segment may be inserted so as 
to bring about replication of the inserted segment. Vectors may be expression vectors containing 
an inserted nucleic acid segment that is operably linked to expression control sequences. An 

15 expression vector is a vector that includes one or more expression control sequences, and an 
expression control sequence is a DNA sequence that controls and regulates the transcription 
and/or translation of another DNA sequence. Expression control sequences include, for 
example, promoter sequences, transcriptional enhancer elements, and any other nucleic acid 
elements required for RNA polymerase binding, initiation, or termination of transcription. With 

20 respect to expression control sequences, the term operably linked means that the expression 
control sequence and the inserted nucleic acid sequence of interest (also referred to herein as the 
exogenous nucleic acid sequence that is intended to be eicpressed, also referred to as the 
exogenous nucleic acid sequence) are positioned such that the inserted sequence is transcribed 
(e.g., when the vector is introduced into a host cell). A transcriptional unit in a vector may thus 

25 comprise an expression control sequence operably linked to an exogenous nucleic acid sequence. 
For example, a DNA sequence is operably linked to an expression-control sequence, such as a 
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promoter when the expression control sequence controls and regulates the transcription and 
translation of that DNA sequence. The term "operably linked" includes having an appropriate 
start signal {e.g., ATG) in front of the DNA sequence to be expressed and maintaining the 
correct reading frame to permit expression of the DNA sequence under the control of the 

5 expression control sequence to yield production of tire desired protein product. Examples of 
vectors include: plasmids, adenovirus, Adeno-Associated Virus (AAV), Lentivirus (FIV), 
Retrovirus (MoMLV), and transposons. 

There are a variety of promoters that could be used including, e.g., constitutive 
promoters, tissue-specific promoters, and inducible promoters. Promoters are regulatory signals 

10 that bind RNA polymerase in a cell to initiate transcription of a downstream (3 '-direction) 
coding sequence. 

Many different types of vectors are known. For example, plasmid vectors and viral 
vectors, e.g., retroviral vectors, are known. Mammalian plasmid expression vectors typically 
have an origin of replication, a suitable promoter and optional enhancer, and also any necessary 

15 ribosome binding sites, a polyadenylation site, splice donor and acceptor sites, transcriptional 
termination sequences, and 5' flanking nontranscribed sequences. In addition, the expression 
vectors preferably contain a gene to provide a phenotypic trait for selection of transformed host 
cells such as neomycin resistance for eukaryotic cell culture, or tetracycline or ampicillin 
resistance in E. coli. Retroviral vectors, which typically transduce only dividing cells, can be 

20 used. Adenoviral vectors, capable of delivering DNA to quiescent cells can be used. Another 
viral vector system with potential advantages is an adeno-associated viral vector. 
Transposonal Vectors 

Nature uses two devices for introducing new genetic material into chromosomes of 
eukaryotic organisms. The first is viruses, against which most animals have defensive systems 
25 to protect their chromosomes from outside intruders. The second method used over long 
evolutionary periods (often millions of years) is to use transposons, which will enzymatically 
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insert appropriate sequences of DNA into cellular chromosomes. However, merely introducing a 
gene into the genome of an animal is of little effect unless that gene is expressed to make a 
protein or confer some new property or function to the cell. So, it is useful to introduce 
transgenes into chromosomes in such a fashion that they can be expressed following delivery. 
5 Very often genes introduced by viruses are switched off soon after they enter chromosomes. 
This is a defense animals have against viruses. However, the Sleeping Beauty System is a non- 
viral method for delivery of individual genes into animal chromosomes. These newly introduced 
genes thus can be maintained and expressed over very long periods of time and can be passed 
from one cell generation to the next without loss of expression. Thus, by employing transposons 

10 and avoiding viruses, one has the ability to direct the integration of a wide variety of transgenes 
(foreign genes) into chromosomes of various vertebrate animals. 

Transposons or transposable elements include a short piece of nucleic acid bounded by 
repeat sequences. Active transposons encode enzymes that facilitate the insertion of the nucleic 
acid into DNA sequences. In vertebrates, the discovery of DNA-transposons, mobile elements 

15 that move via a DNA intermediate, is relatively recent. Since then, inactive, highly mutated 
members of the Tel /mariner as well as the hAT (hobo/Ac/Tam) superfamilies of eukaryotic 
transposons have been isolated from different fish species, Xenopus and human genomes. These 
transposable elements transpose through a cut-and-paste mechanism; the element-encoded 
transposase catalyzes the excision of the transposon from its original location and promotes its 

20 reintegration elsewhere in the genome. Autonomous members of a transposon family can 
express an active transposase, the £ra/is-acting factor for transposition, and thus are capable of 
transposing on their own. Nonautonomous elements have mutated transposase genes but may 
retain cis-acting DNA sequences. These cis-acting DNA sequences are also referred to as 
inverted terminal repeats. Some inverted repeat sequences include one or more direct repeat 

25 sequences. These sequences usually are embedded in the terminal inverted repeats (IRs) of the 
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elements, which are required for mobilization in the presence of a complementary transposase 
from another element. 

A transposase is an enzyme that is capable of binding to DNA at sequences termed 
inverted terminal repeats. Transposons typically contain at least one, and preferably two, 
5 inverted repeats that flank an intervening nucleic acid sequence. The transposase binds to 
recognition sites in the inverted repeats and catalyzes the incorporation of the transposon into 
DNA. Transposons can be mobile, in that they can move from one position on DNA to a second 
position on the same or a different DNA molecule in the presence of a transposase. There are 
typically two components of a mobile cut-and-paste type transposon system, a source of an 

10 active transposase, and the DNA sequences that are recognized and mobilized by the 
transposase. Mobilization of the DNA sequences permits the intervening nucleic acid between 
the recognized DNA sequences to also be mobilized. 

Cells that may be exposed to, or transfected by, transposons can be obtained from a 
variety of sources including bacteria, fungi, plants and animals, e.g., a vertebrate or an 

15 invertebrate; for example, crustaceans, mollusks, fish, birds, mammals, rodents, ungulates, 
sheep, swine and humans. Cells that may be exposed to a transposon include, e.g., lymphocytes, 
hepatocytes, neural cells, muscle cells, a variety of blood cells, and a variety of cells of an 
organism. Transposition is one of nature's methods for introducing new genetic material into 
chromosomes. Naturally, the process operates over evolutionary periods of time, i.e. hundreds 

20 of thousands to millions of years per transfer. As a result there are few, if any, defenses against 
transposition - the cost of defense is far too high for such infrequent benefit Another natural 
method of introducing genes into chromosomes is via viruses; which in contrast to transposons 
pose constant threats to cells and therefore elicit host responses. As a consequence, transposons 
have been used for two general purposes in bacteria, yeast and lower animals such as insects and 

25 nematodes, gene transfer and gene discovery via insertional mutagenesis 56,162 " 164 . The first 
application is as a non-viral transfer vector to direct the incorporation of specific nucleotide 



WO 2004/065581 PCT/US2004/000977 

-25- 

sequences into chromosomes. These sequences generally are fully able to express the genetic 
information carried in the transposon and thus represent a useful, non-viral method to achieve 
transfer of active genes. The second application of transposons is as an insertional mutagen 165 " 
170 . In this case, a gene's function is interrupted by the integration of a transposon into the 
5 transcriptional unit. The power of insertional mutagens is dependent on the degree of the 
randomness of their integrations in genomic DNA. Transposons appear to integrate randomly 160 . 

For these applications, a source of transposase is required. Historically, transposase has 
been supplied via its gene that is transcribed to mRNA that then is translated into the transposase 
polypeptide. The transposase proteins then bind to the inverted terminal repeats for the cut-and- 

10 paste events that comprise transposition. In some instances, e.g., germ cells, mRNA encoding 
the transposase protein can be injected. For some systems injection of the transposase protein 
itself is done. By delivering transposase mRNA or protein, one can avoid the possible 
integration of the transposase gene, which in some circumstances is not desired. For nearly all 
cases, the source of transposons is a plasmid that contains the transposon. Plasmids are circular 

15 DNA molecules that are capable of being amplified to high numbers in appropriate bacteria such 
as Escherichia coli. For most purposes, up to 10 12 plasmids can be routinely harvested and 
purified from each ml of bacterial culture 173 . Alternatively, viral vectors may be used to deliver 
a transposase, transposon, or both a transposase and a transposon. 
The Sleeping Beauty Transposonal Vector 

20 A particularly useful vector is a tranposase/transposon system for introducing nucleic 

acid sequences into the DNA of a cell 9 as set forth in U.S. Patent No. 6,489,458 and U.S. Patent 
Serial Nos. 09/191,572 entitled "Nucleic Acid Transfer Vector For The Introduction Of Nucleic 
Acid Into The DNA Of A Cell"; Serial No. 09/569,257 entitled "Vector-Mediated Delivery Of 
Integrating Transposon Sequences"; Serial No. 10/128,998 entitled "Transposon System For 

25 Gene Delivery In Vertebrates"; and Serial No. 10/128,998 "Composition For Delivery Of 
Compounds To Cells"; see also PCT application WO 99/25817, entitled "Nucleic Acid Transfer 
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Vector For The Introduction Of Nucleic Acid Into The DNA of a Cell", PCT Application WO 
00/68399 "Vector-mediated delivery of integrating transposon sequences"; see also: Ivies, Z., P. 

B. Hackett, R. H. Plasterk and Z. Izsvak (1997). "Molecular reconstruction of Sleeping Beauty, a 
Tel -like transposon from fish, and its transposition in human cells." Cell 91: 501-510 ; Izsvak, 

5 Z., Z. Ivies and R. H. Plasterk (2000). ''Sleeping Beauty, a wide host range transposon vector for 
genetic transformation in vertebrates." J. Mol. Biol. 302: 93-10 ; Karsi A., B. Moav, P.B. 
Hackett and Z. Liu (2001) "Effects of insert size on transposition efficiency of the Sleeping 
Beauty transposon in mouse cells." Mar. Biotechnol. 3: 241-245; Cui, Z., A. M. Geurts, G. Liu, 

C. D. Kaufman and P. B. Hackett (2002); "Structure-function analysis of the inverted terminal 
10 repeats of the sleeping beauty transposon." J. Mol. Biol. 318: 1221-1235.; Harris, J. W., D. D. 

Strong, M. Amoui, D. J. Baylink and K. H. Lau (2002). "Construction of a Tel -like transposon 

Sleeping Seaway-based gene transfer plasmid vector for generation of stable transgenic 

mammalian cell clones." Anal. Biochem. 310: 15-26 ;Izsvak, Z. s D. Khare, J. Behlke, U. 

Heinemann, R. H. Plasterk and Z. Ivies (2002). "Involvement of a Afunctional, paired-like 
15 DNA-binding domain and a transpositional enhancer in Sleeping Beauty transposition." J. Biol. 

Chem.277: 34581-34588; Vigdal, T. J., C. D. Kaufman, Z. Izsvak, D. F. Voytas and Z. Ivies 

(2002). "Common physical properties of DNA affecting target site selection of Sleeping Beauty 

and other Tcl/mariner transposable elements." J. Mol. Biol.323: 441-452; 8. Zayed, H. 5 Z. 

Izsvak, D. Khare, U. Heinemann and Z. Ivies (2003). "The DNA bending protein HMGB1 is a 
20 cellular cofactor of Sleeping Beauty transposition." Nucleic Acids Res. 31: 2313-2322; Geurts, 

A.M., Y. Yang, KJ. Clark, Z. Cui, AJ. Dupuy, D.A. Largaespada and P.B. Hackett (2003). 

"Gene transfer into genomes of human cells by the Sleeping Beauty transposon system." Mol. 

Therap. 8: 108-117; Ivies Z., CD. Kaufman, H. Zayed, C. Miskey, O. Walisko, Z. Izsvak Z. 

(2004). "The Sleeping Beauty transposable element: evolution, regulation and genetic 
25 applications." Curr. Issues Mol. Biol. 6: 43-55.; 11. Yant, S.R., and M.A. Kay (2003). 

"Nonhomologous-end-joining factors regulate DNA repair fidelity during Sleeping Beauty 
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element transposition in mammalian cells." Mol. Cell. Biol. 23: 8505-8518. ; P.B. Hackett, and 
R.S. Mclvor. (2003). "Counterselection and co-delivery of transposon and transposase functions 
for the study of mediated transposition in cultured mammalian cells." Som. Cell Mol. Genet, (in 
press); Izsvak, Z., S.E., Stuwe, D. Fiedler, A. Katzer, P.A. Jeggo and Z. Ivies (2004). Healing the 
5 wounds inflicted by Sleeping Beauty transposon by double-strand break repair in mammalian 
somatic cells. Mol. Cell, in press; Luo, G., Z. Ivies, Z. Izsvak and A. Bradley (1998). 
"Chromosomal transposition of a Tel /mariner-like element in mouse embryonic stem cells." 
Proc. Natl. Acad. Sci. USA 95: 10769-10773. ; Dupuy, A. J., S. Fritz and D. A. Largaespada 
(2001). "Transposition and gene disruption in the male germline of the mouse." Genesis 30: 82- 

10 88; Fischer, S. E., E. Wienholds and R. H. Plasterk (2001). "Regulated transposition of a fish 
transposon in the mouse germ line." Proc. Natl. Acad. Sci. USA 98: 6759-6764. ; Horie, K. ? A. 
Kuroiwa, M. Ikawa, M. Okabe, G. Kondoh, Y. Matsuda and J.Takeda (2001). "Efficient 
chromosomal transposition of a Tcl/mariner- like transposon Sleeping Beauty in mice." Proc. 
Natl. Acad. Sci. USA 98: 9191-9196 ; Dupuy, A. J., K. Clark, C. M. Carlson, S. Fritz, A. E. 

15 Davidson, K. M. Markley, K. Finley, C. F. Fletcher, S. C. Ekker, P. B. Hackett, S. Horn and D. 
A. Largaespada (2002). "Mammalian germ-line transgenesis by transposition." Proc. Natl. Acad. 
Sci. USA 99: 4495-4499.; Roberg-Perez, K., CM. Carlson and D.A. Largaespada (2003). 
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81.; Carlson, C ? A. 3 Dupuy, S. Fritz, K. Roberg-Perez, C.F. Fletcher and D.A. Largaespada 
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Plasterk (2001). "Regulated transposition of a fish transposon in the mouse germ line." Proc. 
Natl. Acad.Sci. USA 98: 6759-6764; Horie, K., A. Kuroiwa, M. Ikawa, M. Okabe, G. Kondoh, 
Y. Matsuda and J. Takeda (2001). "Efficient chromosomal transposition of a Tel /mariner- like 
transposon Sleeping Beauty in mice." Proc. Natl. Acad. Sci. USA 98: 9191- 9196; Dupuy, A. I, 

15 K. Clark, C. M. Carlson, S. Fritz, A. E. Davidson, K. M. Markley, K.Finley, C. F. Fletcher, S. C. 
Ekker, P. B. Hackett, S. Horn and D. A. Largaespada (2002). "Mammalian germ-line 
transgenesis by transposition.'Troc. Natl. Acad. Sci. USA 99: 4495-4499; Roberg-Perez, K., 
CM. Carlson and D.A. Largaespada (2003). 'INITIO: a database of Sleeping beauty transposon 
insertions in mice. 55 Nucl. Acids Res. 31: 78-81; Carlson, C, A., Dupuy, S. Fritz, K. Roberg- 

20 Perez, C.F. Fletcher and D.A. Largaespada (2003). "Transposon mutagenesis of the mouse 
germline/'Genetics 165:243-256; Horie, K„ K. Yusa, K. Yae, j. Odajima, S.E.J. Fischer, V.W. 
Keng, T. Hayakawa,S. Mizuno, G. Kondoh, T. Ijiri, Y. Matsuda, R.H.A. Plasterk and J. Takeda 
(2003). "Characterization of Sleeping Beauty transposition and its application to genetic 
screening in mice." Mol. Cell. Biol. 23: 9189-9207; Yant, S. R., L. Meuse, W. Chiu, Z. Ivies, Z. 

25 Izsvak and M. A. Kay (2000). "Somatic integration and long-term transgene expression in 
normal and haemophilic mice using a DNA transposon system." Nature Genetics 25: 35- 
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41.Montini, E., P. K. Held, M. Noll, N. Morcinek, M. Al-Dhalimy 5 M. Finegold, S. R. Yant, M. 
A. Kay and M. Grompe (2002). "In Vivo Correction of Murine Tyrosinemia Type I by DNA- 
Mediated Transposition." Mol. Therapy 6: 759-769.; 3. Yant, S. R, A. Ehrhardt, J. G. 
Mikkelsen, L. Meuse, T. Pham and M. A. Kay (2002). "Transposition from a gutless adeno- 
5 transposon vector stabilizes transgene expression in vivo." Nature Biotechnol. 20: 999-1005. ; 
Ortiz, S-, Q- Lin, S.R. Yant, D. Keene, M.A. Kay and P.A. Khavari (2003). "Sustainable 
correction of junctional epidermollysis bullosa via transposonmediated nonviral gene transfer." 
Gene Therapy 10: 1099-1104. ; Belur, L., J.L. Frandsen, A. Dupuy, D.H. Xngbar, D.L. 
Largaespada, P.B. Hackett and R.S. Mclvor. "Integration and long-term expression in lung 

10 mediated by the Sleeping Beauty transposon system." Mol. Therapy 8: 501-507. ; Liu, G 3 Z. Cui, 
EX. Aronovich, C.B. Whitley and P.B. Hackett (2003). "Excision of Sleeping Beauty 
transposons: parameters and applications to gene therapy." J. Gene Medicine (in press); 
Mikkelsen, J.G., S.R Yant, L. Meuse, Z. Huang, H. Xu and M.A. Kay (2003) Helper- 
independent Sleeping Beauty transposon-transposase vectors for efficient nonviral gene delivery 

15 and persistent gene expression in vivo. Mol. Therapy 8: 654-665.; Kren, B.T., S.S. Ghosh, C.L. 
Linehan, N. Roy-Chowdhury, P.B. Hackett, J. Roy-Chowdhury and C.J. Steer (2003). 
"Hepatocyte-targeted delivery of Sleeping Beauty mediates efficient gene transfer in vivo:' Gene 
Ther. Mol. Biol. 7: 229- 38.; Izsvak, Z. and Z. Ivies (2004). Sleeping Beauty transposition: 
biology and applications for molecular therapy. Mol. Therap.: in press; Davidson, A.E., D. 

20 Balciunas, D. Mohn, J. Shaffer, S. Hermanson, S. Sivasubbu, M.P. Cliff, P.B. Hackett, and S.C. 
Ekker (2003). "Efficient gene delivery and gene expression in zebrafish using the Sleeping 
Beauty Transposon." Dev. Biol. 263:191-202.; Grabher C, T. Henrich, T. Sasado, A. Arenz, J. 
Wittbrodt and M. Furutani-Seiki (2003). "Transposon-mediated enhancer trapping in medaka." 
Gene. 322: 57-66.; Clark, KJ. A.M. Geurts, J. Bell, and P.B. Hackett (2004). "Transposon 

25 vectors for gene-trap insertional mutagenesis in vertebrates" (submitted); Hackett, P.B., S.C. 
Ekker and J.J. Essner (2003). Applications of transposable elements in fish for transgenesis and 
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flinctional genomics. In Fish Developmental Biology and Genetics. (Zhiyuan Gong and Vladimir 
Korzh,eds.) (in press); Li, Z. H. 5 D. P. Liu, J. Wang, Z. C. Guo, W. X. Yin and C. C. Liang 
(1998). "Inversion and transposition of Tel transposon of C. elegans in mammalian cells." 
' Somat. Cell. Mol. Genet. 24: 363-369.; Plasterk, R.H.A., Z. Izsvak and Z. Ivies (1999). 
5 "Resident aliens: the Tcl/mariner superfamily of transposable elements." Trends Genet. 15: 326- 
332.; Richardson, P., C. Thoma, B. T, Kren and C. J. Steer (2002). "Strategies for hepatic gene 
correction." J Drug Target 10: 133-141; Richardson, P. D., L. B. Augustin, B. T. Kren and C. J. 
Steer (2002). "Gene repair and transposon-mediated gene therapy." Stem Cells 20: 105- 
118.;Kren, B.T., NR. Chowdhury, J.R. Chowdhury and CJ. Steer (2002). "Gene therapy as an 
10 alternative to liver transplantation." Liver Transpl. 8:1089-1108; Miskey, C, Z. Izsvak, R.H.A. 
Plasterk and Z. Ivies (2003). "The Frog Prince: a reconstructed transposon from Rana pipiens 
with high transpositional activity in vertehrate cells." Nucleic Acids Res. 31: 6873-6881; each of 
which is hereby incorporated by reference herein, including all nucleic acid sequences and 
references thereto. 

15 The Sleeping Beauty system is the first transposon system designed to integrate foreign 

DNA (called transgenic DNA) efficiently into vertebrate chromosomes. The Sleeping Beauty 
system involves the cooperation of the Sleeping Beauty transposases with inverted terminal 
repeats on the transposon. The repeats are capable of specifically binding to the transposase and 
are required for the transfection process to occur. Specifically bind is a term that refers to a 

20 binding event that involves a stereotyped interaction between two components and is a term of 
art that distinguishes such events from non-specific, or background binding. For example, CTCF 
specific binding to various elements has been documented, e.g., reference 41. 

In an embodiment of the gene transfer system herein, the Sleeping Beauty (SB) protein 
can be introduced into the cell as a protein or as nucleic acid encoding the protein. In one 

25 embodiment the nucleic acid encoding the protein is RNA and in another, the nucleic acid is 
DNA. Further, nucleic acid encoding the Sleeping Beauty protein can be incorporated into a cell 
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through a viral vector, anionic or cationic lipid, or other standard transfection mechanisms 
including electroporation, particle bombardment or microinjection used for eukaryotic cells. 
Following introduction of nucleic acid encoding Sleeping Beauty, the nucleic acid fragment of 
this invention can be introduced into the same cell. 

Similarly, the nucleic acid fragment can be introduced into the cell as a linear fragment or 
as a circularized fragment, preferably as a plasmid or as recombinant viral DNA. Preferably the 
nucleic acid sequence comprises at least a portion of an open reading frame to produce an 
amino-acid containing product. In a preferred embodiment, the nucleic acid sequence encodes at 
least one protein and includes at least one promoter selected to direct expression of the open 
reading frame or coding region of the nucleic acid sequence. The protein encoded by the nucleic 
acid sequence can be any of a variety of recombinant proteins new or known in the art. In one 
embodiment the protein encoded by the nucleic acid sequence is a marker protein such as GFP, 
chloramphenicol acetyltransferase (CAT), beta-galactosidase (lacZ), and luciferase (LUC). In 
another embodiment, the protein encoded by the nucleic acid is a growth hormone, for example 
to promote growth in a transgenic animal, or insulin-like growth factors (IGFs). 
Transposonal and Viral Vectors 

There are some distinct differences between the use of viruses and transposons for gene 
transfection. One difference is that a transposon works in conjunction with a transposase. A 
transposase is a protein that controls removal and insertion of the transposon DNA. The 
presence of a transposon, by itself, will not cause transfection of a gene unless the transposase is 
present in the same cell as the transposon. Further, the transposase must be capable of binding to 
the transposon and cooperating with it to accomplish transfection. 

Some aspects of successful gene therapy are to 1) find the appropriate gene for transfer, 
2) find a method for delivery of the gene to the tissues that are affected or to other tissues that 
can provide the necessary activity from afar, and 3) achieve long-term expression of the 
transgene so that repeated deliveries are not required. 4) Above all, the gene must be introduced 
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and expressed in a reliably safe manner. Viruses invade cells and in some cases deliver their 
genetic material to chromosomes. But, because there are so many viruses to which we are 
exposed, animals have developed elaborate defensive mechanisms to protect against most viral 
infections. As a result, genes delivered by viruses often are silenced soon after delivery, 
necessitating multiple therapeutic infections. But, multiple deliveries of viruses commonly elicit 
immunological responses that can be harmful and even lethal to the patient. 

One conventional approach to addressing the silencing of virally delivered genes is to use 
insulator elements. Indeed, the elements have been used to enhance the expression of the virally 
delivered genes in some circumstances. A transposon, however, avoids the silencing of virally- 
delivered genes, and can be maintained and expressed over very long periods of time, and can be 
passed from one cell generation to the next without loss of expression. A transposon, therefore, 
would not conventionally be expected to be useful in combination with an insulator. As 
described herein, however, an insulator may advantageously be used to silence host genes, even 
with a transposon. Indeed, one advantage of the transposon system is that it avoids the use of 
viruses. Genes delivered into the livers of mice using, for example, the Sleeping Beauty 
Transposon System can partially restore deficiencies of several enzymes (Yant et al. 5 2000; PCT 
WO 01/30965). 
Adm inistration 

One aspect of employing a transposon system in gene therapy is devising a mechanism 
for its delivery. One common method is the hydrodynamic delivery wherein a relatively large 
volume of transgenic DNA is injected into the circulatory system (the tail vein in mice) under 
high pressure - most of this DNA winds up in cells of the liver 171 . Another method is to use 
negatively charged liposomes containing galactocerebroside, or complexed with lactosylated 
polyethyleneimine (PEI), which have been effective in delivering nucleic acids into hepatoma 
cells, primary hepatocytes and liver cells in living mice 172 . 

The delivery of transposons to any tissue in the body is contemplated, including cells 
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found in blood, liver, lung, pancreas, muscle, eye, brain, nervous system, organs, dermis, 
epidermis, cardiac, and vasculature. Delivery may be by, e.g., direct injection into or near the 
desired tissue, complexation with molecules that preferentially or specifically bind to a target in 
the desired tissue, control release, oral, intramuscular, and other delivery systems that are known 
5 to those skilled in these arts. Another embodiment for delivery is electroporation, e.g., 
electroporation of cells in the blood using an electroporator. Cells may be microinjected or 
electroporated in vitro or in vivo; detailed materials and methods for such processes are provided 
in U.S. Patent Serial No. . 60/513,052, filed October 21, 2003, entitled "Materials and Methods 
of Using Transposons Encoding RNAi", which is incorporated by reference herein in its entirety. 

10 RNAi is described in greater detail elsewhere, e.g., see below, and in Yin and Wan (J.Q. Yin a 
nd Y. Wan. International Journal Of Molecular Medicine, (2002), 10: 355-365). As categorized 
by Yin and Wan, RNAi includes long double stranded RNAs, long single stranded sense RNA, 
single stranded RNAs that form duplexes, short double stranded RNAs, and short antisense 
RNAs. RNAi is the subject of U.S. Patent and PCT applications, e.g., certain of the following: 

15 US20030125281; US20030130186; US20030124513; US200301 19017; US20030144239; 
US20030166282; US20030148519; US20030157691; US20030153519; US20030139363; 
US20030166512; US20030036056; WO03056022; WO03020931; WO03008573; WO0244321; 
WO03070895; WO03070193; WO03070750; WO03070918; WO03070914; WO03066650; 
WO03068797; WO02097114; W09946372; WO0060115; W09519788; WO9206988; and US 

20 6,562,570 US 5,985,661. US 5,750,380 US 5,750,380 US 5,272,262 US 5,149,796; US 
5,144,019; and US 5,110,802. Use of RNAi and other materials and methods as described in 
these publications is contemplated in combinations with the embodiments described elsewhere 
herein. 

Examples of delivery of certain embodiments herein include via injection, including 
25 intravenously, intramuscularly, or subcutaneously, and in a pharmaceutically acceptable carriers, 
e.g., in solution and sterile vehicles, such as physiological buffers {e.g., saline solution or glucose 
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serum). The embodiments may also be administered orally or rectally, when they are combined 
with pharmaceutically acceptable solid or liquid excipients. Embodiments can also be 
administered externally, for example, in the form of an aerosol with a suitable vehicle suitable 
for this mode of administration, for example, nasally. Further, delivery through a catheter or 
5 other surgical tubing is possible. Alternative routes include tablets, capsules, and the like, 
nebulizers for liquid formulations, and inhalers for lyophilized or aerosolized agents. 

Presently known methods for delivering molecules in vivo and in vitro, especially small 
molecules, nucleic acids or polypeptides, may be used for the embodiments. Such methods 
include microspheres, liposomes, other microparticle vehicles or controlled release formulations 
10 placed in certain tissues, including blood. Examples of controlled release carriers include 
semipermeable polymer matrices in the form of shaped articles, e.g., suppositories, or 
microcapsules and U.S. Patents Nos. 5,626,877; 5,891,108; 5,972,027; 6,041,252; 6,071,305, 
6,074,673; 6,083,996; 6,086,582; 6,086,912; 6,110,498; 6,126,919; 6,132,765; 6,136,295; 
6,142,939; 6,235,312; 6,235,313; 6,245,349; 6,251,079; 6,283,947; 6,283,949; 6,287,792; 
15 6,296,621; 6,309,370; 6,309,375; 6,309,380; 6,309,410; 6,317,629; 6,346,272; 6,350,780; 
6,379,382; 6,387,124; 6,387,397 and 6,296,832. Moreover, formulations for administration can 
include transdermal patches, ointments, lotions, creams, gels, drops, suppositories, sprays, 
liquids, and powders. 
Activation-safe transposons 
20 In another embodiment, an Activation-safe transposon includes a Sleeping Beauty 

Transposon System having a gene that can direct the demise of a cell in which either the 
therapeutic gene or the transposase gene is expressed (see Figures 3 and 4). Such a gene is 
colloquially referred to as a "suicide gene (SG)." The SG is expressed whenever the transgene is 
expressed. The term suicide sequence nucleic acid is used herein to refer to nucleic acid 
25 fragments that encode triggerable elements for cellular destruction. Embodiments herein that 
refer to a suicide gene will therefore be understood as also being applicable to a suicide sequence 
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nucleic acid. Should an adverse event occur in a patient that is due to insertion of the therapeutic 
sleeping beauty vector, a substrate for the SG can be administered to the patient that will cause 
the product of the SG gene to kill the cells in which it is expressed. This procedure will undo the 
benefits of the therapeutic gene and can be used whenever the problems of the therapeutic 

5 construct are greater than the benefits. 

For example, the Thymidine Kinase (TK) gene of the Human Herpes Simplex Virus is 
placed behind a marker gene (e.g., neo or Green Fluorescent Protein) or a therapeutic gene (e.g., 
glucuronidase B, GUSB) through an Internal Ribosome Entry Site (IRES) derived from the 
Encephalomyocarditis Virus (EMCV). As a result, two genes are co-transcribed as a single 

10 mRNA from the enhancer-promoter cassette driving the therapeutic gene. The EMCV IRES 
sequence allows expression of the TK gene, generally at a lower rate. The EMCV IRES element 
is an often-used sequence for this purpose (e.g., Fahrenkrug et al. 5 1999). The HSV TK protein 
will not affect cells unless the drug gangcyclovir is added, at which time the HSV TK gene 
converts the drug into a poisonous material that kills the cell in which the HSV TK gene is 

1 5 expressed. The HSV TK gene has been used for this purpose (Borelli et al., 1988) and has been 
proposed for gene therapy as a killing agent for cancer cells (e.g., Anderson, 1998). The suicide 
gene can be co-expressed with the transgene via a common promoter, with the IRES allowing 
translation of the suicide gene. Alternatively, the suicide gene can be independently transcribed 
from a separate promoter that can be independently activated. These two examples are shown in 

20 Figure3. 

Other suicide sequence nucleic acids have been described in the patpetner literature, as 
in, for example: EP 0 694 070 Recombinant alphavirus vectors; WO 95/07994 Recombinant 
alphavirus vectors; EP 0 777 739 Genetic therapy of vascular diseases with a cell-specific active 
substance which is dependent on the cell cycle; EP 0 804 601 Genetic therapy of tumors with an 
25 endothelium cell-specific active substance which is dependent on the cell cycle; and EP 0 807 
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183 Genetic therapy of diseases caused by the immune system, said therapy using a cell-specific 
active substance which is dependent on the cell cycle. 

The SG can also be attached to the Sleeping Beauty transposase gene to preclude its 
activity after initial transposition has occurred. This process relieves concerns that a transposase 

5 gene integrated into chromosomes of host cells could give a low level transposase activity that 
might lead to transformation of the cell in deleterious ways. Suicide genes include, but are not 
limited to, HSV TK, yeast or bacterial cytosine deaminase, microbial purine nucleoside 
phosphorylase, or any other gene encoding an enzyme that activates an inert substance to a 
product that is toxic in a cell in which it is expressed. 

10 Embodiments include a transposon containing a genetic sequence (e.g., insulator 

sequence). Such elements/sequences may inhibit or block the activities of genetic signals 
outside the transposon from acting on genetic sequences inside the transposon. Other 
embodiments include a transposon containing genetic sequences that can direct the synthesis of a 
product that can kill a cell when activated by a substrate molecule. Other embodiments include a 

15 suicide gene attached to SB transposase gene. Other embodiments include a combination of a 
border element, insulator sequence, a suicide gene and their use on a transposon, a transposase, 
or both. Variations of the embodiments include use of the transposon with and without 
transposase (either as a gene, mRNA or protein), and use of transposase in trans (separate 
plasmid as transposon) and cis (on the same plasmid) configurations. 

20 As an example, insulator elements may include, but are not limited to natural elements 

(e.g., HS4, chicken lysozyme A elements, Drosophila scs/scs 5 ) and synthetic derivatives thereof. 
A natural element is an element found in nature but a synthetic element is not found in nature. 
Both natural and synthetic elements may be synthesized using, e.g., a machine. Border elements 
may include, but are not limited to natural elements (e.g., HS4, chicken lysozyme A elements, 

25 Drosophila scs/scs') and synthetic derivatives thereof. 
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Activation-safe transgenic vectors, e.g., transposons and/or viruses, overcome problems 
wherein randomly inserted genetic material may deleteriously affect expression of genes residing 
in chromosomes. Viral vectors or non-viral vectors such as Sleeping Beauty transposons are 
used. Set forth herein are some examples of applications to overcome potential adverse effects 
5 of transgenic DNA (Milot et al., 1996). A notable application is in human gene therapy, where 
adverse events due to unintended activation of genes that caused problems to the patients. 
Activation-safe gene therapy vectors that contain insulator elements generally do not induce 
inappropriate expression of chromosomal genes outside the vector itself. The fail-safe version of 
gene therapy vectors, e.g., Figure 3, allow clinicians treating a patient "to pull the plug" in a 

10 patient in which the gene therapy vector causes unintended adverse effects. 
Additional examples, methods, and uses 

Certain embodiments described below involve the Sleeping Beauty transposon system 
that will either keep enhancers within the transposon from activating genetic sequences outside 
of the transposon or allow investigators/physicians to kill those cells that harbor a transposon 

15 that has activated a nearby gene(s). Some embodiments allow transposase activity to be 
controlled and are sometimes referred to herein as Activation-Safe Transposons and Fail-Safe 
Transposons. 

Some embodiments include a transposon that includes an insulator sequence. Such a 
transposon may further include an exogenous DNA sequence. An exogenous sequence is a 

20 sequence that is intended to be introduced into an animal a cell, a nucleus, or into another DNA 
sequence. An exogenous sequence may be natural or synthetic and may be naturally present in 
the animal that receives the exogenous sequence or it may be non-native to the animal. 
Examples of an exogenous sequence include a promoter, an enhancer, a marker sequence, a 
sequence encoding a therapeutic protein or a catalytic RNA, and insulator/border sequences. 

25 Further examples include labeled nucleic acid sequences and nucleic acid fragments that 

encode markers. Labeled nucleic acid sequences are detectable, e.g., by fluorescence, nuclear 
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magnetic resonance imaging, X-ray, microscopy, transmission electron microscopy, light 
microscopy, histological examination, and ELISA. A marker is a factor that allows for 
expression of a nucleic acid sequence to be detected, e.g., a green fluorescent protein, florescent 
molecules, enzymes, biotin, avidin, antibodies, and skin cell pigments. 
5 Other examples of applications include the delivery of marker nucleic acid sequences to 

cells, including cells in plants and animals (including humans) as well as cells that are in cell 
culture, including organ or tissue culture, and including prokaryotic and eukaryotic cells. For 
example, a marker sequence of green fluorescent protein associated with at least one border 
elements/insulator sequence may be delivered to a eukaryotic cell in a petri dish. 
10 A transposon having an exogenous sequence may also contain an enhancer and a 

promoter. Embodiments also include a transposon having an exogenous gene that encodes a 
suicide sequence nucleic acid, e.g., a suicide gene, or other sequence that causes the demise of a 
cell or a cluster of cells, as described below. The suicide sequence may also be used in 
combination with the insulator element. Embodiments set forth herein can be used in 
15 combination with both viral (e.g. lentiviruses, see, e.g., US Patent No. 6,013,516) and non-viral 
vectors (e.g., transposons, see, e.g., US Patent No. 6,489,458). 

One embodiment of the activation-safe insulator transposon includes a Sleeping Beauty 
Transposon System with at least one, e.g., two, insulator elements, see Figure 2. The insulator 
elements preferably are just inside the Inverted Terminal Repeats of an Sleeping Beauty 
20 transposon. The insulator elements act to block activation of genes proximal to the insertion site 
of the gene therapy transposon. Any of several types of insulator elements can be used. The 
structure and characteristics of insulator elements are set forth in greater detail in, for example, 
U.S. Patent Nos. 5,731,178; 6,100,448; 6,395,549; 5,610,053, and PCT WO 00/23606, and 
references Chung et al., 1997; Recillas-Targa et al., 1999; and Bell et al., 1999. The inclusion of 
25 insulator elements does not affect the choice of therapeutic gene. The insulator elements may 
have the additional value of limiting "position effects" by neighboring, resident enhancers. 
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Endogenous transcriptional enhancers can affect expression of an inserted gene in the same way 
that the inserted enhancers of a therapeutic gene can affect expression of resident genes. In this 
way the output of the therapeutic gene is more reliable. 

Another example involves transgenic animals used as bioreactors for the manufacture of 
5 biological reagents, proteins, of medical value (Jaenisch, 1988; Dove, 2000). Transgene 
expressed from vectors similar to those used for human gene therapy direct the synthesis of the 
commercially valuable proteins. Just as the health of a human is important, so for animal 
bioreactors the health of the animal (cow, sheep, goat, pig, fish) is important. The activation- 
safe vectors can be used for transgenesis of animals that will result in reliable transgene 

10 expression without collateral damage to the animal because the embodiments designed for 
humans generally work in all vertebrate animals (Hackett and Alvarez, 2000). 

Another example involves the use of transgenic fish that have been developed as a 
commercial food (Niiler, 2000). There are concerns of the effects of these fish, not only in terms 
of their augmented traits, but also in terms of unintended consequences of genetic engineering 

15 (Reichardt, 2000; Muir and Howard, 2002). Fish appear to be the first of several animals that 
will be genetically engineered for one or more particular traits. Activation-safe vectors can be 
used for transgenesis of animals that will result in reliable transgene expression without 
collateral damage to the animal because the embodiments designed for humans work in all 
vertebrate animals. 

20 Another example relates to transgenic animals, which are extensively used for research 

applications. Functional genomics, the area of genomic science that seeks to attribute function to 
newly found genes from genome projects 5 depends on inactivation of genes as well as 
overexpression of genes. Overexpression of genes involves transgenesis (e.g., Camper, 1987). 
Activation-safe vectors can be used for transgenesis of animals that can result in reliable 

25 transgene expression with fewer or no effects on other endogenous genes that would obscure the 
functional significance of the transgene in a model vertebrate animal. 
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Another example relates to transgenic animals THAT have been deployed as sentinels for 
environmental toxins (e.g., Amanuma et al. 2000; Carvan et al., 2000; Winn et al., 2000). 
Transforming a natural animal into a bio-sentinel generally involves transgenesis. Activation- 
safe vectors can be used for making bio-sentinels without effects on other endogenous genes that 
5 are important for natural physiological pathways in the animal that could affect the performance 
and reliability of the bio-sentinel. 

Another example relates to transgenesis in tissue culture to investigate the effects of 
transgenes. Implicit in the analyses is the assumption that the observed effects are due to the 
transgene and its intended activity (e.g., Hackett et al. 1999). Activation-safe transgenic vectors 
10 will give greater confidence in this fundamental assumption. Markers, therapeutic sequences, 
and sequences for producing a product may be used. Constructs for human gene therapy are first 
conducted in cultured cells and then in animals (e.g., Yant et al. 2000, 2002; Montini et al. 
2002). 

Other examples of applications include the delivery of marker nucleic acid sequences to 
15 cells, including cells in plants and animals (including humans) as well as cells that are in cell 
culture, including organ or tissue culture, and including prokaryotic and eukaryotic cells. For 
example, a marker sequence of green fluorescent protein associated with at least one border 
elements/insulator sequence may be delivered to a eukaryotic cell in a petri dish. 
Nucleic Acids 

20 As used herein, the term nucleic acid refers to both RNA and DMA, including cDNA, 

genomic DNA, synthetic (e.g., chemically synthesized) DNA 5 as well as naturally occurring and 
chemically modified nucleic acids, e.g., synthetic bases or alternative backbones. A nucleic acid 
molecule can be double-stranded or single-stranded (i.e., a sense or an antisense single strand). 
An isolated nucleic acid refers to a nucleic acid that is separated from other nucleic acid bases 

25 that are present in a genome, including nucleic acids that normally flank one or both sides of a 
nucleic acid sequence in a vertebrate genome (e.g., nucleic acids that flank a gene). The term 
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isolated as used herein with respect to nucleic acids also includes non-naturally-occurring 
nucleic acid sequences, since such non-naturally-occurring sequences are not found in nature and 
do not have immediately contiguous sequences in a naturally occurring genome. 

An isolated nucleic acid can be, for example, a DNA molecule, provided at least one of 
5 the nucleic acid sequences normally found flanking that DNA molecule in a naturally-occurring 
genome is removed or absent. Thus, an isolated nucleic acid includes, without limitation, a DNA 
molecule that exists as a separate molecule {e.g., a chemically synthesized nucleic acid, or a 
cDNA or genomic DNA fragment produced by PGR or restriction endonuclease treatment) 
independent of other sequences as well as DNA that is incorporated into a vector, an 
10 autonomously replicating plasmid, a virus (e.g., a retrovirus, lentivirus, adenovirus, or herpes 
virus), or into the genomic DNA of a prokaryote or eukaryote. In addition, an isolated nucleic 
acid can include an engineered nucleic acid such as a DNA molecule that is part of a hybrid or 
fusion nucleic acid. A nucleic acid existing among hundreds to millions of other nucleic acids 
within, for example, cDNA libraries or genomic libraries, or gel slices containing a genomic 
15 DNA restriction digest, is not considered an isolated nucleic acid because such sources do not 
indicate a role for the nucleic acid or its uses. Indeed, there is often no knowledge of the 
sequences present in such sources until their presence is hypothesized as a result of using 
hindsight in light of a new sequence. 

The identity of a protein or nucleic acid sequence is frequently established based on a 
20 sequence alignment of the DNA, RNA, or amino acids. Multiple alignments of such sequences 
are important tools in studying biomolecules. The basic information they provide is 
identification of conserved sequence regions. This is very useful in designing experiments to test 
and modify the function of specific proteins, in predicting the function and structure of proteins, 
and in identifying new members of protein families. Sequences can be aligned across their entire 
25 length (global alignment) or only in certain regions (local alignment). This is true for pairwise 
and multiple alignments. Global alignments with respect to polynucleic acids or polypeptides 
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usually need to use gaps (representing insertions/deletions) while local alignments can usually 
avoid them by aligning regions between gaps. In a sequence alignment, letters arranged over one 
another are called matched. If two matched letters are equal, the match is called an identity 
otherwise the match is called a substitution or mismatch. An insertion or deletion (indel) is one 
5 or more letters aligned against a gap (-) and is considered the same as a mismatch for percent 
identity purposes (Waterman, M.S. 1995). In some cases a determination of the percent identity 
of a sequence relative to another sequence may be required. In such cases, the percent identity is 
measured in terms of the number of residues that are compared, as is customary in these arts. 
Method to Identify Insulator/Enhancer-Blocking Sites for Use in Gene Therapy Vectors to Block 

10 Activation of Resident Chromosomal Genes. 

In this application, an insulator element is defined as a DNA sequence that will block 
transcriptional regulatory sequences in a transgenic construct from acting on neighboring genes 
after the integration of the transgenic construct into a chromosome. A transgenic construct has 
an exogenous nucleic acid that for introduction into a host. 

15 Herein is described a method for locating elements for testing, as described above. 

Persons of ordinary skill that follow these procedures will be able to rapidly identify a large 
number of insulator elements. Such elements, in combination with the many insulator elements 
already described in detail, provide a person of ordinary skill in the art a useful variety of 
insulator elements that may be used to practice the full scope of inventions taught herein. 

20 As described above, the 1.2 kb DNA sequence from the 5 ? end of the chicken D-globin 

gene (5' HS4) has been described as an insulator 23 . The activity of the full-length HS4 insulator 
is found within a 250 bp "core" element 22 . There are two independent functions of the HS4 
insulator- the blocking of enhancer activation and the blocking of chromosomal position 
effects 120 . The enhancer blocking function of the HS4 insulator requires CTCF-binding protein 8 . 

25 The binding sites for CTCF-binding protein in the chicken p -globin loci are conserved in both 
mice and humans 4 ' 1 . The prominent structures of the human (3 -globin CTCF-enhancer-blocking 
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motifs with respect to a transcriptional unit on a chromosome are shown in Fig. 5. There are two 
features to note to the 5'- and 3' insulators (5-Ins and 3'-Ins respectively in the figure, labeled 
500 and 510, respectively); first the 3' sequence is an inverted form of the 5' sequence; second, 
the sequences vary slightly. The transcriptional unit 520 is an exogenous nucleic acid sequence, 
5 e.g., a marker gene or therapeutic gene, operably linked to an enhancer and/or promoter. 

Certain criteria are indicated herein. A person of ordinary skill in these arts, after 
reviewing these criteria, will be able to alter them as needed to suit a particular circumstance 
tailored to a particular research project. The indicated criteria for insulator elements are the 
following: 

10 1) At least 15 basepairs of the 16 basepair human 5 5 - HS4-CTCF-binding site and 15 

basepairs of the 16 basepair human 3 5 - HS1 -CTCF-binding site or the mouse 3 ? -HSl -CTCF- 
binding site must match in a pair of flanking insulators around a therapeutic gene. The mouse 
3 5 -CTCF-binding site works as well or better than the human sequence 41 and is therefore 
included in the searching process. These sequences can be identified by using NIH BLAST 

15 algorithms (see Table 1). The sequences can be prioritized by on the basis of matching of 5 
basepairs on either side of the human HS4-CTCF binding sites. Over 100 exact matches of the 
CTCF-binding sites found near the human and or mouse JJ-globin loci exist in the human 
genome, and there are likely many more functional CTCF-binding sites that are near matches 
since the CTCF-binding sites from the chicken, mouse, and human B-globin sites have some 

20 sequence flexibility while maintaining function across species. The human genomic sequences 
that are predicted to contain CTCF-binding sites can be placed into test vectors by themselves or 
in conjunction with other CTCF- or other insulator-binding sites to test for the level of insulation 
provided. 

2) The motif binds to CTCF or equivalent polypeptide. The test for this function is 
25 mobility shift assay similar to that employed to define transposase activity, and described in 
detail in the literature 159 ' 160 . 
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3) The motif has functional activity in blocking enhancer activation as defined by 
inhibiting enhancer activation of a promoter that is separated from an enhancer by the insulator 
element. This test is described in the following section. 

5 Table I: Examples of Human sequences Matching CTCF-binding sites 



MATCHES TO THE HUMAN 5' HS5 CTCF-binding site (NG_00007.3) 



10 



15 



GeneBank # Chromosome 

NG 000007.3 Chrm 11 



AC002040 



AC068492 



Chrm 16 



Chrm 2 



AL159163.40 Chrm. 6 



20 AP003049.2 Chrm. 11 



DNA sequence 
ttatga CCACTAGAGGGAAGAA gatacc SEQ ID NO : 1 

I I I M 1 1 1 1 1 1 1 M ii I 

gtcaag CCACTAGAGGGAAGAA aacttt SEQ ID NO : 2 

I I II I 1 1 Ml I I 1 1 1 1 I I 

ccccgc CCACTAGAGGGAAGAA aaaaaa SEQ ID NO : 3 



tcatga CCACTAGAGGGCAGAA gagaaa SEQ ID NO : 4 
I I I I I I 1 I 1 ] 1 1 1 1 1 1 1 1 1 1 II 

tgatga CT AC T AGAG GGAAGAA gaaggg SEQ ID NO : 5 



MATCHES TO THE HUMAN 3' HS1 CTCF-BINDING SITE (X54282). 



25 



30 



GeneBank# 

X54282 
AC067801 

AC100821 



Chromosome 

Chrm. 11 
Chrm . 1 8 

Chrm . 8 



DNA sequence 

aactac TTCTGACCCCTAGTGG tgtcca 

II IIIMII III III III 

gtgtaa TTCTGACCCCTAGTGG ctgagg 

ll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l I 

tcaaac TTCTGACCCCTAGTGA tccacc 



SEQ ID NO: 6 
SEQ ID NO: 7 

SEQ ID NO: 8 



35 AC135950 



BX649553 



Chrm. 16 gfcatac TTCTGACCCCTAGTAG gataaa SEQ ID NO : 9 



Chrm, X 



I I ' Ml I I 

ggatgc TTCTGACCCCTAGTGT ccaaaa SEQ ID NO: 10 



40 
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MATCHES TO THE MOUSE V HS1 CTCF-BINDING SITE (AF133300.2) 
GeneBank# Chromosome DNA sequence 

AF133300.2 Chrm. 7 (mus) tgcata CCAGTAGGGGGCAGAA gtgttc SEQ ID NO: 11 

I I llilllllllllllll mi I 
AC073065.6 Chrm. 2 tcactg CCAGTAGGGGGCAGAA gtgtac SEQ ID NO : 12 



AL109955 



NG 000002.1 



AC123023 .4 



Chrm . 2 0 



Chrm. 2 2 



Chrm . 3 



llilllllllllllll III 
atgcag CCAGTAGGGGGCAGAA gtgggg SEQ ID NO: 13 

I llilllllllllllll I I I 
gagaat CCAGTAGGGGGCAGAA gagacc SEQ ID NO: 14 

I llilllllllllllll I I 
gctttg CCAGTAGGGGGCAGAA gagggt SEQ ID NO: 15 



NOTE: flanking sequences in lower case; reference CTCF-binding sites shown in bold upper 



case: 



20 



CCACTAGAGGGAAGAA is SEQ ID NO: 16; TTCTGACCCCTAGTGG is SEQ ID NO: 17; 
CCAGTAGGGGGCAGAA is SEQ ID NO: 1 8. 



Testing of insulator elements in transposon 

Persons of ordinary skill in these arts may use this protocol to determine if a nucleic acid 
sequence is an insulator element. Putative insulator elements are added immediately inside both 

25 ends of a transposon vector to keep the transgenic enhancers from activating neighboring genes 
using standard recombinant DNA techniques that have been used in the past for scs/scs ' and A- 
element insulators, and which are described in detail in the literature 19 ' 20 . The assay involves 
measuring expression from the luciferase (ffLUC) gene 5 under the direction of the CMV minimal 
promoter, as a result of enhancer activity from within a transposon vector with and without 

30 border elements. 

The strategy is shown in Fig. 6, which represents the features of a test plasmid that will 
be transfected into HeLa cells. Transposon plasmids 600 9 700 contain the firefly luc gene 610 
with a minimal CMV promoter 620 outside a Sleeping Beauty transposon. Putative insulators 
630 are placed as shown, with control 700 not receiving an insulator element. Both transposons 

35 600, 700 contain two species of luciferase reporter gene 610, and a CAGS promoter 640. Hie 
dotted line 658 with an "X" in the top panel indicates that enhancer activity from the transgenic 
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enhancer 650 is blocked from activating the minimal CMV promoter 620 for transposon 600 but 
not for transposon 700, which lacks insulator element 630. The enhancer 650 is able to activate 
the CMV promoter when insulators are lacking, as shown by the solid line-arrow 660 in Figure 
6B. 

5 Following transfection, with either standard sleeping beauty transposons (Fig. 6B) or 

Activation-Safe (A-S) transposons (Fig. 6A) with a Renilla (ren) luciferase reporter gene 161 in 
the transposon, transient expression of firefly (ff) luciferase and Renilla luciferase is measured 
48 hrs after transfection. The ratio of ffLUC/renLUC will indicate the level of activation of the 
firefly luc gene by the CAGGS promoter. If the putative sequence is an insulator, one will not 

10 observe ffLUC expression in the absence of an enhancer for the minimal CMV promoter. As a 
control, one deletes the CAGGS promoter from both the A-S transposons and from the Sleeping 
Beauty transposons. This provides the background level of luc expression that will be subtracted 
in the experiments using complete transposons with and without insulators. In this case one 
should not observe significant expression of either ffLUC or renLUC. One expects that the A-S 

15 transposons will exhibit ffLUC/renLUC ratios much lower than those from Sleeping Beauty 
transposons. 
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CLAIMS 

1 . A transposon comprising 

a transcriptional unit and a plurality of insulator elements, wherein the 
transcriptional unit is flanked by at least one insulator element on each side of the 
5 transcriptional unit, wherein the transcriptional unit comprises an exogenous nucleic acid 

for introduction into a cell. 

2. The transposon of claim 1, wherein the transposon comprises at least two inverted repeat 
sequences. 

10 

3. The transposon of claim 1, wherein the insulator element specifically binds to a CTCF 
protein. 

4. The transposon of claim 1, wherein the insulator element comprises a binding site for a 
15 CTCF protein. 

5. The transposon of claim 1 5 wherein the insulator element comprises a 16-base sequence 
that has at least 80% identity with SEQ ID NO: 1 6. 

20 6. The transposon of claim 1, wherein the insulator element comprises a 16-base sequence 
that has at least 80% identity with SEQ ID NO: 1 7. 

7. The transposon of claim 1, wherein the insulator element comprises a 16-base sequence 
that has at least 80% identity with SEQ ID NO:18. 

25 
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8. The transposon of claim 1, wherein the insulator element comprises at least one of SEQ 
ID NO: 16, SEQ ID NO: 17, OR SEQ ID NO: 18. 

9. The transposon of claim 1, wherein at least one of the insulator elements comprises a 
5 scs/scs 1 element from the D, melanogangster 87 A7 heat shock locus. 

10. The transposon of claim 1, wherein at least one of the insulator elements comprises a 
gypsy moth retrotransposon suQtW) binding region-type insulator, 

10 11. The transposon of claim 1 , wherein at least one of the insulator elements comprises an A- 
element-type insulator. 

12. The transposon of claim 1 , wherein at least one of the insulator elements comprises an A- 
element-type insulator from a chicken lysozyme locus. 

15 

13. The transposon of claim 1, wherein at least one of the insulator elements comprises an 
insulator element from a DHS5 site of a chicken (3-globulin locus. 

14. The transposon of claim 1 wherein at least one of the insulator elements comprises the 
20 mammalian apolipoprotein B gene insulator. 

15. The transposon of claim 1 wherein at least one of the insulator elements comprises a 
human (3-interferon gene insulator. 



25 16. The transposon of claim 1 wherein at least one of the insulator elements comprises an sns 
insulator from sea urchin arylsuflatase gene. 
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17. The transposon of claim 1 wherein at least one of the insulator elements comprises a 
Drosophila bithorax Fab-7 insulator. 

5 18. The transposon of claim 1 wherein at least one of the insulator elements comprises a 
mammalian insulator flanking a tyrosinase gene. 



19. The transposon of claim 1, wherein the transcriptional unit is disposed between a first 
insulator element and a second insulator element, and the first insulator element and the second 

1 0 insulator element are disposed between inverted repeats of a transposon. 

20. The transposon of claim 1, wherein the transcriptional unit further comprises at least one 
member of the group consisting of promoters and enhancers. 

15 21. The transposon of claim 1 wherein the exogenous nucleic acid encodes a marker 
molecule. 

22. The transposon of claim 1 wherein the exogenous nucleic acid is a member of the group 
consisting of DNA encoding an antisense RNA or siRNA. 

20 

23. The transposon of claim 1 wherein the exogenous nucleic acid is a member of the group 
consisting of DNA encoding an mRNA. 



25 



24. The transposon of claim 1 comprising at least two inverted repeat sequences that 
specifically bind to a Sleeping Beauty transposase. 
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25. The transposon of claim 1 further comprising a suicide sequence nucleic acid. 

26. The transposon of claim 13 further comprising an independent promoter for the suicide 
sequence nucleic acid. 

5 

27. A cell, the cell comprising the transposon of claim 1 . 

28. The cell of claim 27, wherein the cell is in vitro. 

10 29. The cell of claim 27, wherein the cell is in an animal. 

30. The cell of claim 27, wherein the cell is in a human. 

3 1 . The cell of claim 27, wherein the cell is transfected with the transposon. 

15 

32. The cell of claim 27, wherein the cell is electroporated with the transposon. 

33. The cell of claim 27, wherein the cell is microinjected with the transposon. 

20 34. The cell of claim 27, wherein the cell is electroporated or microinjected with the 
transposon, and the cell produces a protein that is encoded by the exogenous nucleic acid. 

35. The cell of claim 27, wherein the cell is transfected with the exogenous nucleic acid and 
produces a protein that is encoded by the exogenous nucleic acid. 

25 
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36. The cell of claim 27, wherein the cell is a member of the group consisting of 
lymphocytes, pancreatic cells, neural cells, muscle cells, and blood cells. 

37. The cell of claim 27, wherein the cell is a member of the group consisting of hepatocytes 
, hepatoma cells, primary hepatocytes and liver cells. ( 

38. The cell of claim 27 wherein the cell wherein the cell is a stem cell. 

39. The cell of claim 27, wherein the cell wherein the cell is a member of the group 
consisting of primary pancreatic cells and pancreatic stem cells. 

40. The cell of claim 27, wherein the cell wherein the cell is a member of the group 
consisting of primary hematopoietic cells and hematopoietic stem cells. 

4 1 . The cell of claim 3 2, wherein the protein is a marker. 

42. The cell of claim 32, wherein the protein is a therapeutic protein. 

43. The cell of claim 34, wherein the therapeutic protein ameliorates a medical condition. 

44. An animal, the animal comprising the transposon of claim 1 . 

45. The animal of claim 44 wherein the animal is a member of the group consisting of a 
zebrafish, a mouse, and a rat. 



46. An animal embryo, the embryo comprising the transposon of claim 1. 
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47. The animal embryo of claim 46 wherein the embryo is a member of the group consisting 
of a zebrafish, a mouse, and a rat. 

5 48. A method of altering a cell, the method comprising exposing the cell to a transposon that 
comprises a transcriptional unit and a plurality of insulator elements, wherein the transcriptional 
unit is flanked by at least one insulator element on each side of the transcriptional unit, wherein 
the transcriptional unit comprises an exogenous nucleic acid for introduction into a cell. 

10 49. The method of claim 48 wherein the transposon is introduced into the cell by 
electroporation. 

50. The method of claim 48 wherein the transposon is introduced into the cell by 
microinjection. 

15 

51. The method of claim 48 wherein the cell is a member of the group consisting of 
lymphocytes, pancreatic cells, neural cells, muscle cells, and blood cells. 

52. The method of claim 48 wherein the cell is a member of the group consisting of 
20 hepatocytes , hepatoma cells, primary hepatocytes and liver cells. 

53 . The method of claim 48 wherein the cell is a stem cell. 

54. The method of claim 48 wherein the cell is a member of the group consisting of primary 
25 pancreatic cells and pancreatic stem cells. 
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55. The method of claim 48 wherein the cell is a member of the group consisting of primary 
hematopoietic cells and hematopoietic stem cells. 

56. The method of claim 48, wherein the insulator element specifically binds to a CTCF 
5 protein. 

57. The method of claim 48, wherein the insulator element comprises a binding site for a 
CTCF protein. 

10 58. The method of claim 48, wherein the insulator element comprises a 16-base sequence 
that has at least 80% identity with SEQ ID NO: 16. 

59. The method of claim 48, wherein the insulator element comprises a 16-base sequence 
that has at least 80% identity with SEQ ID NO: 17. 

15 

60. The method of claim 48, wherein the insulator element comprises a 16-base sequence 
that has at least 80% identity with SEQ ID NO: 18. 

61 . The method of claim 48, wherein the insulator element comprises at least one member of 
20 the group consisting of SEQ ID NO: 16, SEQ ID NO: 17, and SEQ ID NO: 18. 

62. The method of claim 48, wherein at least one of the insulator elements is chosen from a 
member of the group consisting of scs/scs' elements from the Z>. melanogangster 87A7 heat 
shock locus, a gypsy moth retrotransposon sw(7* ^-binding region, an A-element, an A-element 

25 from a chicken lysozyme locus, a DHS5 site of a chicken p-globulin locus, a mammalian 
apolipoprotein B gene insulator, a human p-interferon gene insulator, an sns insulator from sea 
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urchin arylsuflatase gene, a Drosophila bithorax Fab-7 insulator, and a mammalian insulator 
flanking a tyrosinase gene. 

63. The method of claim 48, wherein the transcriptional unit is disposed between a first 
insulator element and a second insulator element, and the first insulator element and the second 
insulator element are disposed between at least two inverted repeats. 

64. The method of claim 48, wherein the transcriptional unit further comprises at least one 
member of the group consisting of promoters and enhancers. 

65. The method of claim 48 wherein the exogenous nucleic acid encodes a marker molecule. 

66. The method of claim 48 wherein the exogenous nucleic acid is a member of the group 
consisting of antisense DNA, DNA, and cDNA. 

67. The method of claim 48 wherein the exogenous nucleic acid encodes siRNA. 

68. The method of claim 48 comprising at least two inverted repeat sequences that 
specifically bind to a Sleeping Beauty transposase. 

69. The method of claim 48 farther comprising a suicide sequence nucleic acid. 

70. The method of claim 69 further comprising an independent promoter for the suicide 
sequence nucleic acid. 

7 1 . The method of claim 48 further comprising exposing a cell to the transposon. 
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72 . The method of claim 7 1 , wherein the cell is in vitro. 

73 . The method of claim 7 1 , wherein the cell is in an animal. 

5 

74. The method of claim 71, wherein the cell is in a hnman. 

75. The method of claim 71 5 wherein the cell is transfected with the exogenous nucleic acid. 

10 76. The method of claim 71 , wherein the cell is electroporated or microinjected with the 
transposon. 

77. The method of claim 71, wherein the cell is transfected with the exogenous nucleic acid 
and produces a protein that is encoded by the exogenous nucleic acid. 

15 

7 8 . The method of claim 77, wherein the protein is a marker. 

79. The method of claim 77, wherein the protein is a therapeutic protein. 

20 80. The method of claim 79, wherein the therapeutic protein ameliorates a medical condition. 

81. The method of claim 48 further comprising exposing a cell in an animal to the 
transposon. 

25 82. The method of claim 81 wherein the animal is a member of the group consisting of a 
zebrafish, a mouse, and a rat. 
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83 . The method of claim 8 1 wherein the animal is an emhryo. 

84. The method of claim 83 wherein the embryo is a member of the group consisting of a 
5 zebrafish, a mouse, and a rat. 

85. A transposon comprising a transcriptional unit and a means for preventing regulation of 
transcription of host nucleic acid by the transcriptional unit following insertion of into a host 
mammalian cell nuclear genome. 

10 

86. The transposon of claim 85 wherein the host nucleic acid is a gene. 

87. The transposon of claim 85 wherein the transcriptional unit comprises an exogenous 
nucleic acid. 

15 

88. The transposon of claim 85, wherein the means for preventing regulation of transcription 
of host nucleic acid specifically binds to a CTCF protein. 

89. The transposon of claim 85, wherein the means for preventing regulation of transcription 
20 of host nucleic acid comprises a binding site for a CTCF protein. 

90. The transposon of claim 85 ? wherein the transcriptional unit is disposed between a first 
insulator element and a second insulator element, and the first insulator element and the second 
insulator element are disposed between the at least two inverted repeats. 

25 
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91. The transposes of claim 85, wherein the transcriptional unit further comprises at least 
one member of the group consisting of promoters and enhancers. 

92. The transposon of claim 85 wherein the transcriptional unit encodes a marker molecule. 

93. A cell, the cell comprising the transposon of claim 85. 

94. The cell of claim 93, wherein the cell is in vitro. 

95 . The cell of claim 94, wherein the cell is in a human. 

96. An animal, the animal comprising the transposon of claim 85 . 

97. The animal embryo of claim 96 wherein the embryo is a member of the group consisting 
of a zebrafish, a mouse, and a rat. 

98. The transposon of claim 85, wherein the means for preventing regulation of transcription 
of host nucleic acid comprises a 16-base sequence that has at least 80% identity with SEQ ID 
NO:16 

99. The transposon of claim 85 9 wherein the means for preventing regulation of transcription 
of host nucleic acid comprises a 16-base sequence that has at least 80% identity with SEQ ID 
NO: 17 
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100. The transposon of claim 85, wherein the means for preventing regulation of transcription 
of host nucleic acid comprises a 16-base sequence that has at least 80% identity with SEQ ID 
NO:18. 

101 . The transposon of claim 85, wherein the means for preventing regulation of transcription 
of host nucleic acid comprises at least one member of the group consisting of SEQ ID NO: 16, 
SEQ ID NO: 17, and SEQ ID NO: 1 8 . 

102. The transposon of claim 85 wherein the transcriptional unit is a member of the group 
consisting of antisense DNA, DNA, and cDNA. 



WO 2004/065581 



PCT/US2004/000977 




SUBSTITUTE SHEET (RULE 26) 



WO 2004/065581 



PCT/US2004/000977 



2/3 



F/g. 3 A 

V A\ CELL DEATH 

114 WHEN ACTIVATED 



134 



132 





^-150 130 ^-242 ^-240 



-124 
i / ^122^ 120 



10 ^200 



160 



170 



J 




250 



114 



CELL DEATH 
WHEN ACTIVATED 



210 



134 



-132 ^-300 




-100 



170 



IRES-' J 

CELL DEATH 
WHEN ACTIVATED 



SUBSTITUTE SHEET (RULE 26) 



WO 2004/065581 



PCT/US2004/000977 



3/3 



Fig. 5 



-500 



•520 



510 




CCACTAG AGG G AAG AA 



TTCTGACCGCTAGTGG 



Fig. 6A 





SUBSTITUTE SHEET (RULE 26) 



